Movie Reviews Summarization

Project information

  • Course: Big Data Management and Analytics
  • Technology Used: Scala, IntelliJ IDEA, AWS S3
  • Project date: Aug 2019 - Dec 2019
  • Project URL:

Developed K-means and Hierarchical supervised clustering techniques to output positive and negative review summary for each movie per algorithm along with their TF-IDF values in Scala. Compared the the output of two models to derive various conclusion .