Project Description
This project focuses on fine-tune a distilbert model to predict news categories using only news headline. For model demo and downloading the model, please check my HuggingFace Repo🤗. HuggingFace Repository
Data Description
The data is from Kaggle.
There are 200, 000 rows and 42 cotagories in our predict column.
Model Training
To transform text data into vetors, I first applied TfidfVectorizer to preproess text data.
|
|
Project Detail
Please refer to this PDF to check the project details. Project PDF
For more details about the code of the project, please refer to my GitHub Repository