ai-playground

AI POCS: ML, NLP, KAN, Reinforcement Learning, audio, translation, object-detection, GAN, LLM, Vision, Stable Diffusion, Classification, Clustering, XGBoost, NM, RNN, CNN, Pytorch, DJL, DeepLearning4J, ONNX, Keras, Tensorflow, OpenCV, nltk, transformers, transformers.js, sentense-transformers, gradio, chroma, langchain, langchain4j, spring-ai, sklearn, gensim, BERTopic, gymnasium, seaborn, matplotlib, pandas, numpy, scipy, scikit-learn, gradio, chroma, langchain4j, spring-ai, gensim, BERTopic, stable-difussion, anthropic, aws bedrock, cohere, flax, gluonnlp, LM Studio, LocalAI, streamlit, Pinecone. And more... All things AI POCS.

Diego's AI Explained: Making Sense of AI (Essay)

My Essay on AI Explained: Making Sense of AI
https://github.com/diegopacheco/ai-playground/blob/main/AI.Explained.DiegoPacheco.pdf

Diego's Notebooks

Traditional AI

Generative AI

Diego's Hugging Face Demos

Diego's Story Telling Multimodel LLM Gen AI https://huggingface.co/spaces/diegopacheco/gen-ai-multimodel-fun
Diego's Comics Multimodel LLM Gen AI https://huggingface.co/spaces/diegopacheco/genai-llm-multi-model-comics
Diego's LLM Image to Labeled Image https://huggingface.co/spaces/diegopacheco/diego-llm-img-to-lbl
Diego's CrawlPT https://huggingface.co/spaces/diegopacheco/diego-CrawlPT?logs=container
Diego's DialoGPT Chatbot https://huggingface.co/spaces/diegopacheco/diego-llm-chat-DialoGPT?logs=container

Diego's Top POCs

Multi-Model Story Telling AI https://github.com/diegopacheco/ai-playground/tree/main/pocs/transformers-multi-model-story-telling
Transformers VQA https://github.com/diegopacheco/ai-playground/tree/main/pocs/transformers-dqa-donut-base-finetuned-docvqa
Llama2 - 7B Unit Test generation https://github.com/diegopacheco/ai-playground/tree/main/pocs/transformers-CodeGPT-small-java
Transformers - Big Code - Generate and Execute Code https://github.com/diegopacheco/ai-playground/tree/main/pocs/transformers-bigcode
Transformers Object Detection https://github.com/diegopacheco/ai-playground/tree/main/pocs/transformers-object-detection
Transformers Pipeline - GPT-like https://github.com/diegopacheco/ai-playground/blob/main/pocs/sentence-transformers-simple-gpt-like/
Sentense Transformers - Image Search by Text https://github.com/diegopacheco/ai-playground/tree/main/pocs/sentence-transformers-image-search
Reinforcement Learning Playing Atari Pacman https://github.com/diegopacheco/ai-playground/tree/main/pocs/reinforcement-learning-Pacman
Transformers.js - Sentiment Analysis React Web App https://github.com/diegopacheco/ai-playground/tree/main/pocs/react-transformers.js-sentiment-analysis/transformers-react
Transformers T5 - Translation https://github.com/diegopacheco/ai-playground/tree/main/pocs/transformers-T5
Tabular Q&A - Transformers https://github.com/diegopacheco/ai-playground/tree/main/pocs/transformers-table-question-anwsering-tapas
Transformers DistilBERT - Text Sumarization https://github.com/diegopacheco/ai-playground/tree/main/pocs/transformers-distilbart-sumarization
Transformers DistilBERT - Q&A https://github.com/diegopacheco/ai-playground/tree/main/pocs/transformers-distilbert-qna
OpenCV - CNN - Sentiment on Image https://github.com/diegopacheco/ai-playground/tree/main/pocs/opencv-sentiment-from-image
OpenCV template matching from images https://github.com/diegopacheco/ai-playground/tree/main/pocs/opencv-template-matching-images
Llama3 - Chatbot and QA 1.5 70B (nvidia) (slow on CPU) https://github.com/diegopacheco/ai-playground/tree/main/pocs/nvidia-llama3-ChatQA-1.5-70B
NLTK - Sentiment Analysis https://github.com/diegopacheco/ai-playground/tree/main/pocs/nltk-sentiment-analysis
Transformers - Llama 3 - 8B Text Generation https://github.com/diegopacheco/ai-playground/tree/main/pocs/llama-3-fun
Langchain4J - OpenSearch / Redis - Document Embeddings and Similarity search (Vector Database) https://github.com/diegopacheco/ai-playground/tree/main/pocs/langchain4j-opensearch https://github.com/diegopacheco/ai-playground/tree/main/pocs/langchain4j-redis
Langchain4J - LocalAI - Documment Enbeddings and Similarity search https://github.com/diegopacheco/ai-playground/tree/main/pocs/langchain4j-local-ai-document-embeddings
Langchain4J - Chatbot with FALCON_7B https://github.com/diegopacheco/ai-playground/tree/main/pocs/langchain4j-chat-FALCON_7B
Keras Speam Classification https://github.com/diegopacheco/ai-playground/tree/main/pocs/keras-classificaiton-spam
Customer2Vec - Customer Segmentation, clustering and product recomendation by similarity https://github.com/diegopacheco/ai-playground/tree/main/pocs/customer-2-vec-dumb
Phi-3-mini-128k Generate Code in Rust https://github.com/diegopacheco/ai-playground/tree/main/pocs/chat-Phi-3-mini-128k-instruct
AWS Bedrock LLAMA3 8B Model - Text Generation https://github.com/diegopacheco/ai-playground/tree/main/pocs/aws-bedrock-llama38B-instruct
ONNX - Portable format for ML models - export tool https://github.com/diegopacheco/ai-playground/tree/main/pocs/ONNX-model-export
DJL - GP2-Text Generation + BERT QA Inference https://github.com/diegopacheco/ai-playground/tree/main/pocs/DJL-gpt2-qa-bert-and-gen-text
DJL - Action Recognition in Image https://github.com/diegopacheco/ai-playground/blob/main/pocs/DJL-action-recognition/
DJL - Hand Written Digit Recognition https://github.com/diegopacheco/ai-playground/tree/main/pocs/DJL-Handwritten-Recognition-MLP
Spring AI + Onnx - Embeddings https://github.com/diegopacheco/ai-playground/tree/main/pocs/spring-ai-ONNX-transformers-all-MiniLM-L6-v2/project
SkLearn Linear Regression - Ridge https://github.com/diegopacheco/ai-playground/tree/main/pocs/sklearn-regression-ridge
Stable Diffusion 3 - Image Generation https://github.com/diegopacheco/ai-playground/tree/main/pocs/stable-diffusion-3

What AI can do?

Traditional AI:

Classification: Classify data into predefined categories.
Regression: Predict continuous values.
Clustering: Group similar data points.
Decision-making: Make decisions based on rules and logic.
Optimization: Find the best solution among options.
Natural Language Processing (NLP): Understand and generate human language (chatbots, sentiment analysis).
Computer Vision: Interpret and understand visual data (image recognition, object detection).
Robotics: Control and interact with physical devices.
Expert Systems: Mimic human expertise in specific domains.
Predictive Maintenance: Predict equipment failures and schedule maintenance.

Generative AI (GenAI):

Text Generation: Create new text, such as articles, stories, or conversations.
Image Generation: Create new images, such as photos, artwork, or designs.
Music Generation: Compose music, melodies, or sound effects.
Video Generation: Create new videos, such as animations or clips.
Data Generation: Create synthetic data for training or testing.
Style Transfer: Transfer styles between images, music, or text.
Image-to-Image Translation: Translate images from one domain to another.
Text-to-Image Synthesis: Generate images from text descriptions.
Dialogue Generation: Engage in conversation, responding to user input.
Creative Writing: Generate creative writing, such as poetry or short stories.

Awesome OSS Libraries

Gynasium: A collection of AI Gym Environments for Reinforcement Learning https://gymnasium.farama.org/

BERTopic is a topic modeling technique that leverages 🤗 transformers and c-TF-IDF to create dense clusters https://github.com/MaartenGr/BERTopic

Gensim: Topic Modeling for Humans, LDA, LSI, HDP, DTM, Word2Vec, FastText, Doc2Vec https://radimrehurek.com/gensim/index.html

SentenseTransformers: Multilingual Sentence Embeddings using BERT / RoBERTa / XLM-RoBERTa & Co. https://sbert.net/

nltk: Natural Language Toolkit, Tokenization, Stemming, Lemmatization, POS Tagging, Named Entity Recognition, etc. https://www.nltk.org/index.html

Keras: High-Level Neural Networks API, Convolutional, Recurrent, Dense, Embedding, etc. https://keras.io/

OpenCV: Open Source Computer Vision Library (Image, Video, Face Detection, Object Detection, Tracking, etc.) https://opencv.org/

PyTorch: ML Framework, Tensors, Datasets, DataLoaders, Models, Optimizers, Loss Functions, etc. https://pytorch.org/

Scikit-Learn: ML Library, Classification, Regression, Clustering, Dimensionality Reduction, Model Selection, Preprocessing, etc. https://scikit-learn.org/stable/

Gradio: Build UIs for your machine learning models https://github.com/gradio-app/gradio

Chroma: the AI-native open-source embedding database https://www.trychroma.com/

Transformers.js: A JavaScript library for running large language models in the browser (on local using ONNX) run Hugging face models in the browser. https://github.com/xenova/transformers.js

LocalAI: LocalAI is a platform that enables you to run large language models on your device (local copy of OpenAI's using Hugging Face's transformers) https://localai.io/

Prompt library https://docs.anthropic.com/claude/prompt-library

ONNX is an open format built to represent machine learning models. ONNX defines a common set of operators - the building blocks of machine learning and deep learning models - and a common file format to enable AI developers to use models with a variety of frameworks, tools, runtimes, and compiler https://onnx.ai/

LangChain: A Decentralized AI Platform for Language Models https://www.langchain.com/

LangChain4J: A Java SDK for LangChain https://github.com/langchain4j/langchain4j

DL4J: Deeplearning4j is an open-source, distributed deep-learning library written for Java and Scala. Integrated with Hadoop and Spark, DL4J is designed to be used in business environments on distributed GPUs and CPUs https://deeplearning4j.konduit.ai/

Deep Java Library (DJL): An open-source, high-level, engine-agnostic Java framework for deep learning built by AWS https://docs.djl.ai/index.html

SpringAI: Spring AI is a Spring project that aims to provide a simple and consistent way to work with AI and ML libraries in the Spring ecosystem https://spring.io/projects/spring-ai

Papers

AWS Recommended

ReAct: Synergizing Reasoning and Acting in Language Models https://arxiv.org/abs/2210.03629

Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) Network https://arxiv.org/abs/1808.03314

Attention Is All You Need https://arxiv.org/abs/1808.03314

High-Resolution Image Synthesis with Latent Diffusion Models https://arxiv.org/abs/2112.10752

Tree of Thoughts: Deliberate Problem Solving with Large Language Models https://arxiv.org/abs/2305.10601

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks https://arxiv.org/abs/2005.11401

Other industry-based papers

Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? https://arxiv.org/abs/2405.05904

Chameleon: Mixed-Modal Early-Fusion Foundation Models https://arxiv.org/pdf/2405.09818

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context https://storage.googleapis.com/deepmind-media/gemini/gemini_v1_5_report.pdf

Images that Sound: Composing Images and Sounds on a Single Canvas https://ificl.github.io/images-that-sound/ https://arxiv.org/pdf/2405.12221

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer (T5) https://arxiv.org/pdf/1910.10683

Researchers say their new algorithm trained on a database of TV show Friends clips can detect sarcasm 75% of the time. https://www.popsci.com/technology/sarcasm-algorithm/ https://aclanthology.org/P19-1455.pdf

xLSTM: Extended Long Short-Term Memory https://arxiv.org/pdf/2405.04517

Measuring Massive Multitask Language Understanding (MMMLU) https://arxiv.org/pdf/2009.03300

RAGAS: Automated Evaluation of Retrieval Augmented Generation https://arxiv.org/pdf/2309.15217

From LLM to NMT: Advancing Low-Resource Machine Translation with Claude https://arxiv.org/pdf/2404.13813.pdf

Batch Prompting: Efficient Inference with Large Language Model APIs https://arxiv.org/pdf/2301.08721v1.pdf

The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey https://arxiv.org/abs/2404.11584

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone https://arxiv.org/abs/2404.14219

Automated Discovery of Functional Actual Causes in Complex Environments https://arxiv.org/pdf/2404.10883.pdf

Can Large Language Models Infer Causation from Correlation? https://arxiv.org/abs/2306.05836

RewardBench: Evaluating Reward Models for Language Modeling https://arxiv.org/pdf/2403.13787v1.pdf

Simple and Scalable Strategies to Continually Pre-train Large Language Models https://arxiv.org/abs/2403.08763

Longformer: The Long-Document Transformer https://arxiv.org/pdf/2004.05150.pdf

Efficient Estimation of Word Representations in Vector Space (Word2Vec) https://arxiv.org/pdf/1301.3781.pdf

Code Llama: Open Foundation Models for Code https://arxiv.org/pdf/2308.12950.pdf

From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples https://arxiv.org/pdf/2404.07544.pdf

ReFT: Representation Finetuning for Language Models https://arxiv.org/pdf/2404.03592.pdf

OLOMO: Accelerating the Science of Language Models https://arxiv.org/pdf/2402.00838.pdf

FinanceBench: A New Benchmark for Financial Question Answering https://arxiv.org/pdf/2311.11944.pdf

Routerbench: A Benchmark for Multi-LLM Routing System https://arxiv.org/pdf/2403.12031.pdf

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding https://arxiv.org/pdf/1810.04805.pdf

Long-form factuality in large language models https://arxiv.org/pdf/2403.18802.pdf

Is GPT-4 a Good Data Analyst? https://arxiv.org/abs/2305.15038

Chronos: Learning the Language of Time Series https://arxiv.org/abs/2403.07815

Automated Unit Test Improvement using Large Language Models https://arxiv.org/pdf/2402.09171.pdf

Models

Progress Tracker: Open vs. Proprietary LLMs https://huggingface.co/spaces/andrewrreed/closed-vs-open-arena-elo

PaliGemma – Google's Cutting-Edge Open Vision Language Model https://huggingface.co/blog/paligemma

StarCoder2-Instruct: A Large Language Model for Code Generation https://github.com/bigcode-project/starcoder2-self-align

FinGPT: Open-Source Financial Large Language Models https://github.com/AI4Finance-Foundation/FinGPT

LLAMA 2: The Language Model for the Open Web https://huggingface.co/docs/transformers/en/model_doc/llama2

T5: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer https://huggingface.co/docs/transformers/en/model_doc/t5

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding https://huggingface.co/docs/transformers/en/model_doc/bert

GPT-2: Language Models are Unsupervised Multitask Learners https://huggingface.co/docs/transformers/en/model_doc/gpt2

Posts

Neural Network Zoo https://www.asimovinstitute.org/neural-network-zoo/

Solve puzzles. Improve your pytorch. https://github.com/srush/Tensor-Puzzles

llama3 implementation one matrix multiplication at a time https://github.com/naklecha/llama3-from-scratch

History of the Generative AI https://medium.com/@glegoux/history-of-the-generative-ai-aa1aa7c63f3c

Convolutional Neural Network (CNN): A Complete Guide https://learnopencv.com/understanding-convolutional-neural-networks-cnn

Stable Diffusion 3: Research Paper https://stability.ai/news/stable-diffusion-3-research-paper

OWASP Top 10 for Large Language Model Applications https://owasp.org/www-project-top-10-for-large-language-model-applications/

Videos

Generative AI in a Nutshell https://www.youtube.com/watch?v=2IK3DFHRFfw&ab_channel=HenrikKniberg

Building Production-Ready RAG Applications https://www.youtube.com/watch?v=TRjq7t2Ms5I&ab_channel=AIEngineer

All Learning Algorithms Explained in 14 Minutes https://www.youtube.com/watch?v=BT6Aw6Q75Yg&ab_channel=CinemaGuess

3Blue1Brown: Neural Networks and Deep Learning animated math https://www.3blue1brown.com/topics/neural-networks

Let's build GPT: from scratch, in code, spelled out. https://www.youtube.com/watch?v=kCc8FmEb1nY&ab_channel=AndrejKarpathy

ML on Kubernetes

KServe https://github.com/kserve/kserve

Kubeflow https://www.kubeflow.org/

MLOperator https://github.com/polyaxon/mloperator

Interesting Solutions

The Best LLM on Every Prompt (LLM Routiung) https://unify.ai/

Vectara (RaG as a Service) https://vectara.com/

Name		Name	Last commit message	Last commit date
Latest commit History 549 Commits
concepts		concepts
papers-summary		papers-summary
pocs		pocs
.gitignore		.gitignore
AI.Explained.DiegoPacheco.pdf		AI.Explained.DiegoPacheco.pdf
LICENSE		LICENSE
README.md		README.md
create-ai-python3-project.sh		create-ai-python3-project.sh
install-ai-dev-tools.sh		install-ai-dev-tools.sh
run-local-ai.sh		run-local-ai.sh

License

diegopacheco/ai-playground

Folders and files

Latest commit

History

Repository files navigation

ai-playground

Diego's AI Explained: Making Sense of AI (Essay)

Diego's Notebooks

Diego's Hugging Face Demos

Diego's Top POCs

What AI can do?

Awesome OSS Libraries

Papers

AWS Recommended

Other industry-based papers

Models

Posts

Videos

ML on Kubernetes

Interesting Solutions

About

Topics

Resources

License

Stars

Watchers

Forks

Languages