16. About Me
B.A., Mount Holyoke College
Major: Mathematics
Minor: Computer Science
Education
Interests Reading
Art History
Hiking
Stephanie Stark
17. Future Work
Implement TF/IDF compatibility for project
Use PCA
Implement cosine similarity for feature
clustering
Cluster within metro area
Use Redis as a cache for feature vectors
Editor's Notes
Some picture relating to actual project … maybe 3 or 4 related to each other?? Music + choices.
Pictures???
advantage of using kafka for just batch. EXPLAIN or delete!!!!! Add in HDFS.
vector in n-dimensional space for better visualization
vector in n-dimensional space for better visualization
vector in n-dimensional space for better visualization
explain what songkick, echonest, freq, & wt ; size of data
explain what songkick, echonest, freq, & wt ; size of data
advantage of using kafka for just batch. EXPLAIN or delete!!!!! Add in HDFS.
Talk about trying to implement PCA… ugh
what is partition key, clustering column, why??
advantage of using kafka for just batch. EXPLAIN or delete!!!!! Add in HDFS.