NOTICE: As of September 18, 2023, login to calendar.vt.edu was disabled. Calendar admins will no longer be able to add new events or modify existing events.
If you need assistance with an existing event on calendar, please contact us: https://webapps.es.vt.edu/support/.

 Event Calendar
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
   Day      Week     Month  
1 1 1 1 1
 
1 1 1 1 1 1 1 1 1 1 1 1
   Search      Update  
1 1 1


Wednesday, July 22, 2015
 

Apr 2024
  S M T W T F S
W13 31 1 2 3 4 5 6
W14 7 8 9 10 11 12 13
W15 14 15 16 17 18 19 20
W16 21 22 23 24 25 26 27
W17 28 29 30 1 2 3 4


Today is:
Mon, Apr 29, 2024


Subscribe & download

Filter events


4:00pm
to
6:00pm
  LISA Statistics Short Course: Multivariate Clustering in R  
(Academic)

LISA SHORT COURSES IN STATISTICS
LISA (Virginia Tech's Laboratory for Interdisciplinary Statistical Analysis) is providing a series of evening short courses to help graduate students use statistics in their research. The focus of these two-hour courses is on teaching practical statistical techniques for analyzing or collecting data. See www.lisa.stat.vt.edu/?q=short_courses for instructions on how to REGISTER and to learn more.

Summer 2015 Schedule:
Wednesday, June 24: Designing Experiments;
Wednesday, July 1: Basics of R;
Wednesday, July 8: Generalized Linear Models (GLMs) and Categorical Data Analysis (CDA);
Wednesday, July 15: Graphics in R;
Wednesday, July 22: Multivariate Clustering in R;
Wednesday, July 29: Sample Size Calculations;
Wednesday, August 5: Using mixed effects models to quantify dependency among repeated measures;


Wednesday, July 22, 4:00-6:00 pm;
Location: 1080 Torgersen Hall;
Instructor: Yuhyun Song;
Title: Multivariate Clustering in R;

Multivariate analysis in statistics is a set of useful methods for analyzing data when there are more than one variables under consideration. Multivariate analysis techniques may be used for several purposes, such as dimension reduction, clustering, or classification. The primary goal of this short course is to help researchers who want to understand multivariate data and explore multivariate analysis tools.

In this course, we briefly talk about general multivariate analysis, then concentrate on clustering techniques. The goal of clustering analysis is to establish a set of meaningful groups of similar objects by investigating relationships between objects. For example, if you have data from customers, you may segment customers into clusters based on their buying habits and their demographical characteristics. Then, you can use clustering results to custom tailor your marketing efforts. In this course, we will explore two popular clustering techniques: Agglomerative hierarchical clustering and K-means clustering algorithm. Also, we discuss how to choose the number of clusters and how to visualize the clustering solutions. R software will be used in this course.

This course covers:
1. What is clustering analysis? Why is clustering analysis important?
2. Agglomerative hierarchical clustering algorithm
3. K-means clustering algorithm
4. How do we choose the number of clusters?
5. How to visualize the clustering solutions

Data Set:
The data set can be downloaded at http://archive.ics.uci.edu/ml/datasets/Wine. The data set includes 178 wines grown in the same region in Italy. 13 attributes which are chemical analysis results of wines were measured from each wine. We will use this data set for exploring the clustering algorithms.

The graph below shows the clustering results by the K-means clustering method.
www.lisa.stat.vt.edu/sites/default/files/images/K-means-clustering.png

Follow us on Facebook (www.facebook.com/Statistical.collaboration) or Twitter (www.twitter.com/LISA_VT) to be the first to know about LISA events!
More information...


Location: 1080 Torgersen Hall
Price: Free
Contact: Tonya Pruitt
E-Mail: lisa@vt.edu
540-231-8354
   
copy this event into your personal desktop calendar
powered by VTCalendar 2.2.1