llm-security

Star

Here are 36 public repositories matching this topic...

lastlayer / last-layer-vercel

Star

Example of running last_layer with FastAPI on vercel

llm-security llm-privacy llm-guard llm-guardrails

Updated Apr 5, 2024
Python

nodite / llm-guard-ts

Star

The Security Toolkit for LLM Interactions (TS version)

typescript transformers security-tools adversarial-machine-learning large-language-models llm prompt-engineering chatgpt llmops prompt-injection llm-security

Updated Jan 5, 2024

CyberAlbSecOP / MINOTAUR_Impossible_GPT_Security_Challenge

Star

MINOTAUR: The STRONGEST Secure Prompt EVER! Prompt Security Challenge, Impossible GPT Security, Prompts Cybersecurity, Prompting Vulnerabilities, FlowGPT, Secure Prompting, Secure LLMs, Prompt Hacker, Cutting-edge Ai Security, Unbreakable GPT Agent, Anti GPT Leak, System Prompt Security.

cyber-security security-challenge ai-security prompt-engineering prompt-injection gpt-security llm-security ai-jailbreak ai-jailbreak-prompts prompt-security system-prompt super-prompt prompt-security-challenge ai-cyber-security gpts-security flow-gpt

Updated Mar 27, 2024

awesome-software / llm-attacks

Star

Universal and Transferable Attacks on Aligned Language Models

llm-security

Updated Sep 19, 2023
Python

matthernet / LLM-security-check

Star

CLI tool that uses the Lakera API to perform security checks in LLM inputs

ai artificial-intelligence ai-security large-language-models llm llm-security

Updated Mar 13, 2024
Python

rohilrg / CatchPromptInjection

Star

This repo focus on how to deal with prompt injection problem faced by LLMs

openai-api transformers-models llm langchain prompt-injection llm-security

Updated Oct 19, 2023
Python

pdparchitect / llm-hacking-database

Star

This repository contains various attack against Large Language Models.

security hacking llm llm-security

Updated Apr 16, 2024

mickymultani / TestingGemma2B

Star

Evaluation of Google's Instruction Tuned Gemma-2B, an open-source Large Language Model (LLM). Aimed at understanding the breadth of the model's knowledge, its reasoning capabilities, and adherence to ethical guardrails, this project presents a systematic assessment across a diverse array of domains.

gemma responsible-ai huggingface-transformers llm llms llmops genai llm-security llm-inference genai-usecase largelanguagemodels gemma-2b

Updated Feb 26, 2024
Jupyter Notebook

AiShieldsOrg / AiShieldsWeb

Star

AiShields is an open-source Artificial Intelligence Data Input and Output Sanitizer

ai application-security appsec sensitive-data-security data-security ai-security aisec applicationsecurity llm prompt-engineering aisecurity llm-security llmsecurity llmsec prompt-injection-remediation model-denial-of-service-remediation insecure-output-handling-remediation overreliance-remediation prompt-engineering-security artificial-intelligence-security

Updated May 9, 2024
Python

azminewasi / Awesome-LLMs-ICLR-24

Star

It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) in 2024.

pretrained-models pretrained-weights pretrained-language-model large-language-models llm llms llmops large-language-model llm-serving llm-prompting llm-agent llm-security llm-training llm-inference llm-framework llm-privacy llm-evaluation large-language-models-for-graph-learning large-language-models-and-translation-systems

Updated Apr 4, 2024

balavenkatesh3322 / guardrails-demo

Star

LLM Security Project with Llama Guard

security attack-defense llm aisecurity generative-ai llmops llm-security llama-2 prompt-injection-tool llama-guard

Updated Feb 18, 2024
Python

awesome-software / llm-guard

Star

The Security Toolkit for LLM Interactions

llm-security

Updated Sep 21, 2023
Python

lakeraai / chainguard

Star

Guard your LangChain applications against prompt injection with Lakera ChainGuard.

llm langchain prompt-injection langchain-python llm-security

Updated Apr 17, 2024
Python

arekusandr / last_layer

Star

Ultra-fast, low latency LLM prompt injection/jailbreak detection ⛓️

jailbreak security-tools large-language-models prompt-engineering chatgpt-prompts llm-security llm-local llm-guard llm-guardrails

Updated May 9, 2024
Python

microsoft / BIPIA

Star

A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.

llm-security

Updated Apr 15, 2024
Python

M507 / HackMeGPT

Star

Vulnerable LLM Application

gandalf security-tools damn-vulnerable prompt-engineering prompt-injection llm-security jailbreak-prompt vulnerable-llm-application

Updated Jan 1, 2024
Python

llm-platform-security / SecGPT

Star

SecGPT: An execution isolation architecture for LLM-based systems

sandbox gpt isolation multi-agent-systems openai-api llm chatgpt langchain llm-agent llm-security llm-framework llm-privacy llm-platform llm-based-systems

Updated Apr 29, 2024
Python

LostOxygen / llm-confidentiality

Star

Whispers in the Machine: Confidentiality in LLM-integrated Systems

security machine-learning framework deep-learning transformers openai prompt-toolkit gpt confidentiality systems-security llm prompt-engineering chatgpt prompt-injection llm-security

Updated Apr 24, 2024
Python

levitation-opensource / Manipulative-Expression-Recognition

Star

MER is a software that identifies and highlights manipulative communication in text from human conversations and AI-generated responses. MER benchmarks language models for manipulative expressions, fostering development of transparency and safety in AI. It also supports manipulation victims by detecting manipulative patterns in human communication.

benchmarking sentiment-analysis manipulation transparency fraud-prevention human-computer-interaction human-robot-interaction expression-recognition sentiment-classification fraud-detection psychometrics misinformation conversation-analysis conversation-analytics llm prompt-engineering prompt-injection llm-security llm-training llm-test

Updated Jan 31, 2024
HTML

lakeraai / pint-benchmark

Star

A benchmark for prompt injection detection systems.

benchmark llm prompt-injection llm-security llm-benchmarking

Updated May 7, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the llm-security topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-security topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-security

Here are 36 public repositories matching this topic...

lastlayer / last-layer-vercel

nodite / llm-guard-ts

CyberAlbSecOP / MINOTAUR_Impossible_GPT_Security_Challenge

awesome-software / llm-attacks

matthernet / LLM-security-check

rohilrg / CatchPromptInjection

pdparchitect / llm-hacking-database

mickymultani / TestingGemma2B

AiShieldsOrg / AiShieldsWeb

azminewasi / Awesome-LLMs-ICLR-24

balavenkatesh3322 / guardrails-demo

awesome-software / llm-guard

lakeraai / chainguard

arekusandr / last_layer

microsoft / BIPIA

M507 / HackMeGPT

llm-platform-security / SecGPT

LostOxygen / llm-confidentiality

levitation-opensource / Manipulative-Expression-Recognition

lakeraai / pint-benchmark

Improve this page

Add this topic to your repo