Generative AI
Abbreviations:
Uf = user friendly, can use right away, no installations required
Nf = not user friendly, require installations
Lf = Free Limited Trial
Api = require API key to use
st: = main features and structures of the generative AI
dark red color, such as "text-to-image" = the type of generative AI, or type of system
********** = my preference
a16z-infra
AI-Town (August 6, 2023) - Agent Community
- GitHub
- Generative Agents: Interactive Simulacra of Human Behavior
- Inside AI Town: What AI Can Teach Us About Being Human
- AI Town
Companion-App (July 11, 2023) - Chatbot (Uf as an AI companion)
- GitHub
Adobe
MAX Sneaks - GS3 (Oct 11, 2023) - image - video
Firefly (Beta, March 2023) - text-to-image
AGIResearch
OpenAGI (April 10, 2023) - Chatbot
- st: LLM + Domain Experts
AIAgent
AiAgent.app (Beta, May, 2023) - Automating Task (Uf, Lf)
- st: LLM, AutoGPT/BabyAGI based
- Brief Introduction: AiAgent.app: Create Autonomous Ai Agents In YOUR BROWSER!
AI21 Labs
Jurassic-2 (March, 2023) - LLM
- AI21 Studio (try the Playground, Uf)
Anthropic
Claude 2 (July 11, 2023) - Chatbot - USA and UK only
- New FREE AI Beats ChatGPT Code Interpreter | Claude 2 AI
Claude Pro (Sept 7, 2023) - Chatbot - USA and UK only
Black-forest-labs
Flux.1 (Aug 7, 2024) - Text-to-Image (Nf)
- GitHub
- Opensource, Uncensored, Unbothered. - Flux.1 Image Gen
Ehartford
Wizard-Vicuna-30B-Uncensored GPTQ (June 2023) - Chatbot (Nf)
- Important Notice: An uncensored model has no guardrails. You are responsible for anything you do with the model, just as you are responsible for anything you do with any dangerous object such as a knife, gun, lighter, or car. - More
- Fully Uncensored GPT Is Here 🚨 Use With EXTREME Caution
ElevenLabs
llElevenLabs (2022) - Text-to-Speech (Uf)
Eric Hartford
Dolphin (July 2023) - Dataset
- an open-source and uncensored, and commercially licensed dataset and series of
instruct-tuned language models based on Microsoft's Orca paper.
Facebook Meta AI
LLAMA 2 (July 18, 2023) - LLM
- Hugging Face: Llama-2-70b-chat-hf
- Demo: Llama2 70B Chatbot (Uf)
LIMA (Less Is More for Alignment, May 18, 2023) : LLM
Voicebox (research paper, June 19, 2023) - Speech Generator
GeeKan
MetaGPT (Aug 7, 2023) - Multi-Agent Framework (Nf)
- Paper "METAGPT: META PROGRAMMING FOR MULTI-AGENT COLLABORATIVE FRAMEWORK"
- MetaGPT: Multi-Agent AI Programming Framework
Gemini 1.5 Pro (Feb 15, 2024) - Multimodal
- Paper
- Our next-generation model: Gemini 1.5
- Mixtral of Experts (MoE)
- GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
- KOL Explanation 1: Gemini 1.5 and The Biggest Night in AI
- KOL Explanation 2: Google GEMINI 1.5 Capabilities SHOCKED everyone!
Gemini (Dec. 6, 2023) - Multimodal
- Introducing Gemini
- video illustrations
- Interacting with Gemini through multimodal prompting
- Matthew Berman: GEMINI Beats GPT4!! Google's New Gemini Model Is INSANE
Duet AI (Oct 31, 2023) - AI Assistance
Bard (March 2023) - Chatbot (Uf)
- st: based on Lamda, LLM
- available in many countries
PaLM 2 (May 10, 2023) - LLM
SoundStorm (May 16, 2023) - Text-to-Speech
Haotian Liu
LLaVA -1.5 (Oct 5, 2023) - Text - Image
- Demo
- Github
- Paper: Improved Baselines with Visual Instruction Tuning
Ideogram
Introducing Ideogram 2.0 (Aug 21, 2024) - Text-to-Image
Imartinez
PrivateGPT 2.0 (Nov 9, 2023) - Offline Chatbot about your local docs. (Nf)
- PrivateGPT 2.0 - FULLY LOCAL Chat With Docs (PDF, TXT, HTML, PPTX, DOCX, and more)
PrivateGPT (May 13, 2023) - Offline Chatbot about your local docs. (Nf)
- st: LLM
Leonardo Interactive Pty
Leonardo.Ai (April, 2023) - Text-to-Image (Uf)
Microsoft
TinyTroupe (Nov 11, 2024) - Multi-Agent Persona Simulation (Nf)
- GitHub
- TinyTroupe: A library to simulate Agents for productivity and business scenarios
AutoGen (Sept 26, 2023) - Multi-Agent Framework (Nf)
- You can define and make your own agents to work for you.
- AutoGen Studio (Dec 1, 2023)
Vall-E X (Sept 10, 2023) - Multilingual Text-to-Synthesized Speech
- Demo
- Microsoft’s NEW CREEPY AI ‘VALL-E X’ Shocks The Entire Industry!
Kosmos-2 (June 27, 2023) - Multimodal LLM
- "This work lays out the foundation for the development of Embodiment AI and sheds light on the big convergence of language, multimodal perception, action, and world modeling, which is a key step toward artificial general intelligence." (From the "Abstract" of the paper)
- paper: "Kosmos-2: Grounding Multimodal Large Language Models to the World"
Gorilla (May 2023) - LLM, LLaMA-based (Nf)
- Developed with UC Berkeley
- 1645 API calls (effectively use tools), zero-shot, reduces much hallucination
- Paper
- GitHub
- Why This Is AGI, Explained In 2 Minutes
Microsoft Bing - Search Engine (Uf) *********
HuggingGPT (March 2023) - Chatbot (Uf, Api)
- st: AGI, LLM
MidJourney
MidJourney - text-to-image (Uf, Lf)
Mistralai
Mistral 7B (Oct 10, 2023) - LLM
- Paper
- GitHub
- Demo: fw-mistral-7b
MosaicML
MPT-36B-Chat (June 22, 2023) - Chatbot (Nf)
- st: open source LLM
- MPT-30B LLM
- MPT-30B-Instruct GGML
- Demo: MosaicML MPT-30B-Chat (Uf) **********
MultiOn
Agent Q (Aug 13, 2024) - Agent (Nf)
Next++
NExT-GPT (Sept 13, 2023) - Any-to-Any Multimodal LLM
- paper
- github
- company
Nlpxucan
WizardCoder - 34B (June 1, 2023 update licenses) - LLM
- st: Code Llama
- GitHub
- Hugging Face
Nous Research
Nous Chat - Chatbot (Nov 12, 2024) (Uf)
- Chatbot
OpenAI
ChatGPT Search (Oct 31, 2024)
- Introducing
- The New Era of Search Begins
OpenAI o1 (Sept 12, 2024)
- Introducing OpenAI o1-preview
- Learning to Reason with LLMs
- OpenAI Releases GPT Strawberry 🍓
- Open AI SHIPS: "GPT o1" First Look!
- Can ChatGPT o1-preview Solve PhD-level Physics Textbook Problems?
- OpenAI o1 w/ 3 Logic Tests & QCD Feynman Integral
SearchGPT
- Testing a temporary prototype (July 25, 2024)
- OpenAI's New SearchGPT Shakes Up the Industry
GPT-4o (May 13, 2024)
- Offical
- GPT4o: 11 STUNNING Use Cases and Full Breakdown
GPT List
- Official
- 15 Custom GPTS That Will Change How You Work!
ChatGPT (Nov, 2022) - Chatbot (Uf) *********
- st: LLM
- Custom instructions for ChatGPT (July 20, 2023)
- ChatGPT Enterprise (August 28, 2023)
- ChatGPT can See, Hear and Speak (Sept 25, 2023)
- More Links...
DALL.E 2 (Nov., 2022) - text-to-image (Uf) **********
Auto-GPT (March 2023) - Automating Tasks (Nf)
- st: LLM
Code Interpreter (Alpha, March, 2023) - Chatbot (Nf)
- ChatGPT plugins
- ChatGPT Code Interpreter AI Plugin Demo
- Trying out Code Interpreter for ChatGPT
DALL.E 3 (Sept 2023 announced: Coming soon)
OpenBMB
ChatDev (Sept 26, 2023) - Multi-Agent Framework (Nf)
- st: LLM
OpenGVLab
Ask-Anything (April 28, 2023) - Video Recognition Chatbot
- st: ChatGPT, MiniGPT-4
- Youtube Introduction: Ask-Anything: A Video-To-Text Chatbot using ChatGPT
Phind Technologies
Phing (Nov. 2023) - AI Searching Engine (Uf) **************
- Description
Quora
Poe AI (Feb 6, 2023) - Chatbot
- Uf
- st: ChatGPT-4, Llama 2
- Quora opens its new AI chatbot app Poe to the general public
- A Glimpse into Our Future: Highly Customized AI Bots
Rabbit
R1(Jan 9, 2024) - Large Action Model (LAM)
- Rabbit R1: NEW Personal AI AGENT Device NO ONE Saw Coming
Reworkd AI
AgentGPT (April 19, 2023) - Automating Tasks (Uf) ***********
- st: LLM
Roblox
Roblox Assistant (Sept 8, 2023) - Text-to-Virtural Reality
- Official: Revolutionizing Creation on Roblox with Generative AI
- Roblox’s new AI chatbot will help you build virtual worlds
Sakana.Ai
Ai Scientist (Aug 13, 2024) - Scientific Paper Generator
Sourcell
DoctorGPT (Aug 12, 2023) - Chatbot (Nf)
- st: LlaMa 2
- GitHub
- DoctorGPT: Offline & Passes Medical Exams!
Stability AI
Stable Diffussion (2022) - text-to-image (Uf)
StableLM 3B (April 2023) - Alpha version, LLM
StableVicuna (April 28, 2023) - Chatbot (Uf)
- Stability AI releases StableVicuna, the AI World’s First Open Source RLHF LLM Chatbot
- st: LLM, Open Source
SDXL 0.9 (June 22, 2023) - text-to-image
- Demo of SDXL 0.9 (Uf)
- Open-Source and FREE AI Generator SDXL Is Stunning (Comparison)
Standford University