一个 Golang 实现的相对智能、无需规则维护的通用新闻网站数据提取工具库。含域名探测、网页编码语种识别、网页链接分类提取、网页新闻要素抽取以及新闻正文抽取等组件。
-
Updated
Feb 19, 2024 - Go
一个 Golang 实现的相对智能、无需规则维护的通用新闻网站数据提取工具库。含域名探测、网页编码语种识别、网页链接分类提取、网页新闻要素抽取以及新闻正文抽取等组件。
Natural language processing tools developed by the World Bank's DECAT unit. A suite of text preprocessing and cleaning algorithms for NLP analysis and modeling.
A fully customisable language detection pipeline for spaCy
The goal of this repository is to build a comprehensive set of tools and examples that leverage recent advances in NLP algorithms, neural architectures, and distributed machine learning systems. The content is based on our past and potential future engagements with customers as well as collaboration with partners, researchers, and the open sourc…
Detect the language from the given sentence
Personalized anime recommendations based on collaborative filtering. Discover your next favorite anime!
Plateforme de Connaissances Unifiées (PCU) project (i.e Unified Knowledge Platform)
This my mini-projects that you may be interested in doing too... Enjoy!!
Crawled only the Bengali comments from cricket news of Bangladeshi newspaper Prothom Alo.
Zabanshenas is a solution for identifying the most likely language of a piece of written text. Demo (👇 )
To 1) create train/test samples of Tatoeba sentences for NLP-related tasks & 2) evaluate the performance of different solutions for detecting the language of a given text.
A Julia package for language identification.
The repository aimed at providing user a functionality of extracting the possible meaningful titles from the mentioned sentence.
extracts and translates foreign language inside images
A vocal assistant for a university reception that responds to certain topics related to the administration with both languages English and French.
Conversate effortlessly in more than 50 languages!
Elasticsearch JS plugin for langdetect module
Add a description, image, and links to the langdetect topic page so that developers can more easily learn about it.
To associate your repository with the langdetect topic, visit your repo's landing page and select "manage topics."