Identify News Category Based on News Headlines (NLP)

Project Description This project focuses on fine-tune a distilbert model to predict news categories using only news headline. For model demo and downloading the model, please check my HuggingFace Repo🤗. HuggingFace Repository HuggingFace Demo Data Description The data is from Kaggle. There are 200, 000 rows and 42 cotagories in our predict column. Model Training Input preprocessing To transform text data into vetors, I first applied TfidfVectorizer to preproess text data....

Restaurant Type Prediction (NLP)

Project Description Kaggle Competition Page The dataset contains details about restaurants and their reviews. The goal of this kaggle competition is to design data mining models to predict the restaurant type using the observed variables. This is a challenge designed for Master of Business Analytics students at the Rady School of Management, University of California, San Diego. This competition is also part of the course requirements for MGTA 415 Analyzing Unstructured Data....