Keywords: NLP, TF-IDF, inverted indexed matrix, web scrawling, word2vec

Description
This project is designed to facilitate student involvement in campus activities. Our team constructs an event search engine for UM Ann Arbor campus that allows users to search for activities of interest and makes corresponding analysis for its performance. The engine crawls information from event websites of different schools and then employs semantic query expansion, usage of the vector space model with tf-idf, BART pretrained model, and query optimization to make a search engine.
This report includes our problem description, followed by a description of our data collection and crawler designs. There is also a description of the evaluation metrics we used and the results. A list of related works, some data samples and output examples are also included.
Report: Please mail to me (hw2894@columbia.edu) or through the mail symbol at the right bottom.