Skip to content
Snippets Groups Projects
rawal's avatar
rawal authored
3999de96
History

PGM Exam 2022

Getting started

You can run the code from

  1. Notebook: Provided in this repo or in colab using:

https://colab.research.google.com/drive/1khqHZc-m3qx2ZO7XjsRc3KeHhOuS2gTV?usp=sharing

  1. Python file
python main.py

Results

The results were generated using colab with tfidf_features in function main set to 10000 and it took about 30 minutes. For fast execution you may set it to 2000, hence it is set as default. But setting it to 2000 will only give result for cosine similarity in PCA for LDA it has to be higher.

Results where the topics are identified between original and modified dataset:

Failed experiments results where topics are not identified between original and modified dataset: