A parser for annotated MuseScore 3 files.
-
Updated
May 23, 2024 - Python
A parser for annotated MuseScore 3 files.
📑 Galician corpus for misogyny detection
Radio Audio Corpus Collection Toolkit with Hackrf One.
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Thai News Dataset from Thai government website.
❤️Emotional First Aid Dataset, 心理咨询问答、聊天机器人语料库
🚁 保险行业语料库,聊天机器人
ParlaMint: Comparable Parliamentary Corpora
DHARMA project Task Force B, Bhaumakara epigraphic corpus.
Leveraging Corpus Metadata to Detect Template-based Translation: An Exploratory Case Study of the Egyptian Arabic Wikipedia Edition
A very simple news crawler with a funny name
ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for use in training models to reverse distortions and recover to original denoised documents.
This repository contains material for a master thesis' project at the University of Pavia: "Automatic Implicit Object completion in Italian: an exploration with BERT"
Extracting character conversations in Genshin Project
Add a description, image, and links to the corpus topic page so that developers can more easily learn about it.
To associate your repository with the corpus topic, visit your repo's landing page and select "manage topics."