Using John Snow Labs NLP Pipeline and TF-IDF to find document matches with a single keyword query.
Keywords should be in a single input file.
Movie plot summaries is first input document, and then later a document containing movie information (title, year, etc) is joined.
Program will returns top 10 movie plot summary matches for a single keyword.
Code can be run on DataBricks notebook.