leondz

Follow

🏗️

vibing

Leon Derczynski leondz

🏗️

vibing

Follow

prof / scientist in cs / nlp / ml

219 followers · 23 following

Achievements

BetaSend feedback

Achievements

BetaSend feedback

Organizations

Block or Report

Block or report leondz

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

leondz/README.md

Hi there 👋

🔭 I research natural language processing and machine learning. I'm currently looking at:
- 🔒 LLM security: hazards manifest if we don't treat language models as unreliable and subvertible. Or as demons!
- 🛡️ Online harms: content safety, misinformation processing, hate speech & abusive language detection. Enumerate risks with Language Model Risk Cards

🏢 I'm principal research scientist at NVIDIA for my day job, principal investigator of Strømberg NLP at ITU Copenhagen by night
🧑‍🎓 I’m still learning sizecoding
🎓 My research papers are on Google Scholar. Ask me about any of them!
🪶 I write NLP, machine learning, and language tech articles on my blog, Inter Human Agreement

Pinned

garak garak Public

LLM vulnerability scanner

Python 861 100
hatespeechdata hatespeechdata Public

Catalog of abusive language data (PLoS 2020)

Python 290 73
lm_risk_cards lm_risk_cards Public

Risks and targets for assessing LLMs & LLM vulnerabilities

Python 20 5
emerging_entities_17 emerging_entities_17 Public

Dataset for the Emerging & Novel Entity NER task (WNUT '17)

110 24
GateNLP/broad_twitter_corpus GateNLP/broad_twitter_corpus Public

The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016)

Jupyter Notebook 63 6
generalised-brown generalised-brown Public

Forked from sean-chester/generalised-brown

C++ implementation of Generalised Brown clustering and python scripts for feature generation (AAAI 2016)

C++ 2