Clustering similar articles together

There have been several research articles published on Covid-19. How to quickly get hold of a right article? By using Natural Language Processing (NLP) techniques, thousands of text articles are converted into a multidimensional matrix. Further using matrix factorization techniques such as PCA, ICA, LDA, clusters of similar articles are identified. Top key words from these clusters are inspected to see if it makes sense.

The actual ipython notebook is in the Github repository (link below).

« Projects Github »