Skip to content
#

tfidfvectorizer

Here are 48 public repositories matching this topic...

Phony News Classifier is a repository which contains analysis of a natural language processing application i.e fake news classifier with the help of various text preprocessing strategies like bag of words,tfidf vectorizer,lemmatization,Stemming with Naive bayes and other deep learning RNN (LSTM) and maintaining the detailed accuracy below

  • Updated May 27, 2020
  • Jupyter Notebook

For our final project, our group chose to use a dataset (from Kaggle) that contained medical transcriptions and the respective medical specialties (4998 datapoints). We chose to implement multiple supervised classification machine learning models - after heavily working on the corpora - to see if we were able to correctly classify the medical sp…

  • Updated May 13, 2021
  • Jupyter Notebook

Data consists of tweets scrapped using Twitter API. Objective is sentiment labelling using a lexicon approach, performing text pre-processing (such as language detection, tokenisation, normalisation, vectorisation), building pipelines for text classification models for sentiment analysis, followed by explainability of the final classifier

  • Updated Apr 3, 2022
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the tfidfvectorizer topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the tfidfvectorizer topic, visit your repo's landing page and select "manage topics."

Learn more