Data Mining, Analytics, and Information Extraction in Intelligent Business Services: Online Ads, Healthcare, and Service Centers |
Date |
Topic |
Related Techniques |
Required Readings |
Optional Readings |
Homework |
Guest Speakers |
Lecture 1 Jan 19 |
Introduction |
|
|
|
|
|
Lecture 2 Jan 26 |
Finacial Services Pridiction of House Prices Health Services: Weight Prediction |
-Linear Prediction - Bayesian Logistic Regression (if time permits) |
TSK Ch 4 |
LGC Ch 3 Prediction methods for babies' birth weight using linear and nonlinear regression analysis (Etikan and Kazim) |
|
|
Lecture 3 Feb 2 |
Finacial Services: Fraud Detection Online Purchase Probabilities
|
- Linear Prediction - Logistic Regression
|
LGC Ch 4 |
|
|
|
Lecture 4 Feb 9 |
Lecture 3
|
- Logistic Regression |
|
|
|
|
Lecture 5 Feb 16 |
Loan Approval Social Networks Eigenrumor Detection Health Services: Cancer Identification |
- Nearest Neighbor - Naïve Bayes |
|
|
|
|
Lecture 6 Feb 23 |
Lecture 4 CrowdScience Project |
- Naïve Bayes |
|
|
CrowdScience.com |
|
Lecture 7 Mar 2 |
Jenny Presentation Why Naïve Bayes works |
- Naïve Bayes |
|
|
|
|
Lecture 8 Mar 9 |
Text Mining using SVD Search Engines Claritics Research Project |
- Indexing - Space based search engine - SVD - LSI |
|
Probabilistic Principal Component Analysis (Tipin & Bishop) Sensitivity of PCA to Traffic Anomaly Detection (Ringberb) Using Latent Semantic Indexing to Filter Spam Topic Identification with soft clustering using PCA and ICA (Zhukouv) |
|
LawPivot.com |
Lecture 9 Mar 16 |
Market Segmentation |
- Clustering |
|
|
|
Claritics.com |
Mar 23 |
Spring Break |
|
|
|
|
|
Lecture 10 Mar 30 |
(attribute to William Cohen, CMU) Entity Extraction |
- Introduction - Named Entities Recognition (NER) |
Information Extraction, S. Sarawagi, FnT Databases, 1(3), 2008. Information Extraction: Distilling Structured Data from Unstructured Text, McCallum, ACM Queue 2005. |
|
|
- Introduction - Named Entities Recognition (NER) |
Lecture 11 Apr 6 |
Recommender System |
Item-based User-based Hybrid Nearest Neighbor |
|
Factor in the Neighbors: Scalable andAccurate Collaborative Filtering |
|
|
Lecture 12 Apr 13 |
Recommender System |
Latent Semantic Analysis Recommender System Collaborative Filtering |
|
|
|
|
Lecture 13 Apr 20 |
|
|
|
|
|
|
Lecture 14 Apr 27 |
|
|
|
|
|
|
|
|
|
|
|
|
|