The entire framework from converting raw data to preprocessed data usable by ML algorithm, training an ML algorithm, and finally using the output of the ML algorithm to perform actions in the real-world is the Pipeline. We discussed the importance of the Pipeline and the adequate choice of the ML algorithm related to the different feature types, sparsity and amount of data.
45. • Instead of using two coordinates ( 𝒙, 𝒚) to describe
point locaCons, let’s use only one coordinate (𝒛)
• Point’s posiCon is its locaCon along vector 𝒗↓ 𝟏
• How to choose 𝒗↓ 𝟏 ? Minimize reconstruc)on error
SVD – Dimensionality ReducCon
v1
first right
singular vector
Movie 1 rating
Movie2rating