Search

Find trending repositories by name or description.

100 results for “llm”

OpenAI-compatible proxy that aggregates free-tier keys from ~14 AI providers with automatic failover. For personal experimentation only.

3.7k513MIT

100

#21

Doorman11991/smallcodeJavaScript

AI coding agent optimized for small LLMs. 87% benchmark with 4B-active model.

96766MIT

#35

rtk-ai/rtkRust

CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies

52.7k3.2kApache-2.0

#37

TauricResearch/TradingAgentsPython

TradingAgents: Multi-Agents LLM Financial Trading Framework

78.5k15.3kApache-2.0

#49

earendil-works/piTypeScript

AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods

52.8k6.3kMIT

#68

ggml-org/llama.cppC++

LLM inference in C/C++

112.2k18.6kMIT

#82

ZhuLinsen/daily_stock_analysisPython

LLM驱动的 A/H/美股智能分析：多数据源行情 + 实时新闻 + LLM决策仪表盘 + 多渠道推送，零成本定时运行，纯白嫖. LLM-powered stock analysis system for A/H/US markets.

38.5k37.1kMIT

#105

nashsu/llm_wikiTypeScript

LLM Wiki is a cross-platform desktop application that turns your documents into an organized, interlinked knowledge base — automatically. Instead of traditional RAG (retrieve-and-answer from scratch every time), the LLM incrementally builds and maintains a persistent wiki from your sources。

8.9k1.1k

#108

QuantumNous/new-apiGo

A unified AI model hub for aggregation & distribution. It supports cross-converting various LLMs into OpenAI-compatible, Claude-compatible, or Gemini-compatible formats. A centralized gateway for personal and enterprise model management. 🍥

34.9k7.8kAGPL-3.0

#131

opendatalab/MinerUPython

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

64.5k5.4k

#133

LearningCircuit/local-deep-researchPython

~95% on SimpleQA (e.g. Qwen3.6-27B on a 3090). Supports all local and cloud LLMs (llama.cpp, Ollama, Google, ...). 10+ search engines - arXiv, PubMed, your private documents. Everything Local & Encrypted.

7.9k681MIT

#144

jundot/omlxPython

LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar

14.9k1.3kApache-2.0

#165

BerriAI/litellmPython

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, VLLM, NVIDIA NIM]

47.9k8.3k

#194

agentscope-ai/agentscope-javaJava

AgentScope Java: Agent-Oriented Programming for Building LLM Applications

3.2k685

#207

vllm-project/vllmPython

A high-throughput and memory-efficient inference and serving engine for LLMs

80.7k17.1kApache-2.0

#209

wanshuiyin/Auto-claude-code-research-in-sleepPython

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works with Claude Code, Codex, OpenClaw, or any LLM agent.

10.4k986MIT

#218

yamadashy/repomixTypeScript

📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more.

25.3k1.3kMIT

#220

Shubhamsaboo/awesome-llm-appsPython

100+ AI Agent & RAG apps you can actually run — clone, customize, ship.

111.4k16.5kApache-2.0

#232

llm-d/llm-dShell

Achieve state of the art inference performance with modern accelerators on Kubernetes

3.2k488Apache-2.0

#247

jina-ai/readerTypeScript

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/

10.9k813Apache-2.0

#255

lemonade-sdk/lemonadeC++

Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https://discord.gg/5xXzkMu8Zk

4.1k314Apache-2.0

#259

confident-ai/deepevalPython

The LLM Evaluation Framework

15.6k1.5kApache-2.0

#261

rocketride-org/rocketride-serverC++

High-performance AI pipeline engine with a C++ core and 50+ Python-extensible nodes. Build, debug, and scale LLM workflows with 13+ model providers, 8+ vector databases, and agent orchestration, all from your IDE. Includes VS Code extension, TypeScript/Python SDKs, and Docker deployment.

3.1k905MIT

#271

infiniflow/ragflowPython

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

81.0k9.3kApache-2.0

#274

rasbt/LLMs-from-scratchJupyter Notebook

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

95.4k14.6k

#286

browser-use/browser-harnessPython

Browser Harness | Self-healing harness that enables LLMs to complete any task.

13.5k1.2kMIT

#294

AstrBotDevs/AstrBotPython

AI Agent Assistant & development framework that integrates lots of IM platforms, LLMs, plugins and AI feature, and can be your openclaw alternative. ✨

32.8k2.3kAGPL-3.0

#306

upstash/context7TypeScript

Context7 Platform -- Up-to-date code documentation for LLMs and AI code editors

55.9k2.6kMIT

#324

langfuse/langfuseTypeScript

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

27.7k2.8k

#325

SillyTavern/SillyTavernJavaScript

LLM Frontend for Power Users.

28.1k5.4kAGPL-3.0

#333

mostlygeek/llama-swapGo

Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc

4.2k323MIT

#336

jax-ml/scaling-bookHTML

Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs

999141MIT

#341

unclecode/crawl4aiPython

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

66.1k6.8kApache-2.0

#358

looplj/axonhubGo

⚡️ Open-source AI Gateway — Use any SDK to call 100+ LLMs. Built-in failover, load balancing, cost control & end-to-end tracing.

3.9k477

#373

jingyaogong/minimindPython

🧠「大模型」2小时完全从0训练64M的小参数LLM！Train a 64M-parameter LLM from scratch in just 2h!

50.4k6.4kApache-2.0

#407

PaddlePaddle/PaddleOCRPython

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

78.3k10.5kApache-2.0

#423

AgriciDaniel/claude-obsidianPython

Claude + Obsidian knowledge companion. Persistent, compounding wiki vault based on Karpathy's LLM Wiki pattern. /wiki /save /autoresearch

5.3k608MIT

#436

aaif-goose/gooseRust

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

45.7k4.7kApache-2.0

#444

Mintplex-Labs/anything-llmJavaScript

The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.

60.5k6.5kMIT

#452

CherryHQ/cherry-studioTypeScript

AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs

46.1k4.4kAGPL-3.0

#472

MemoriLabs/MemoriPython

Memori is agent-native memory infrastructure. A LLM-agnostic layer that turns agent execution and conversation into structured, persistent state for production systems.

14.7k2.3k

#493

Tencent/WeKnoraGo

Open-source LLM knowledge platform: turn raw documents into a queryable RAG, an autonomous reasoning agent, and a self-maintaining Wiki.

15.4k1.9k

#495

cheahjs/free-llm-api-resourcesPython

A list of free LLM inference resources accessible via API.

22.0k2.2k

#505

NVIDIA/Model-OptimizerPython

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

2.7k404Apache-2.0

#507

harry0703/MoneyPrinterTurboPython

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

57.6k8.3kMIT

#543

AlexsJones/llmfitRust

Hundreds of models & providers. One command to find what runs on your hardware.

26.5k1.6kMIT

#694

onyx-dot-app/onyxPython

Open Source AI Platform - AI Chat with advanced features that works with every LLM

29.6k4.0k

#703

skyzh/tiny-llmPython

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

4.2k318Apache-2.0

#724

hiyouga/LlamaFactoryPython

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

71.5k8.7kApache-2.0

#810

vllm-project/llm-compressorPython

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

3.3k519Apache-2.0

#822

NVIDIA/TensorRT-LLMPython

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

13.7k2.4k

#831

xorbitsai/inferencePython

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

9.3k826Apache-2.0

#862

vllm-project/speculatorsPython

A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

45389Apache-2.0

#864

cloudwego/einoGo

The ultimate LLM/AI application development framework in Go.

11.4k927Apache-2.0

#898

mnfst/awesome-free-llm-apisJavaScript

List of Permanent Free LLM API (API Keys)

4.5k429CC0-1.0

#901

FunAudioLLM/CosyVoicePython

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

21.2k2.4kApache-2.0

#916

ThinkWatchProject/ThinkWatchRust

Enterprise AI bastion host for secure AI API and MCP access, with unified proxying, RBAC, audit logs, rate limiting, and cost tracking across OpenAI, Anthropic, Gemini, and self-hosted LLMs.

97619

#956

angelos-p/llm-from-scratch

2.9k269

#966

mudler/LocalAIGo

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

46.4k4.1kMIT

#982

BerriAI/litellm-agent-platformTypeScript

Self-hosted platform for running coding agents (Claude Code, Codex, Hermes) in isolated sandboxes with vault proxy.

47848MIT

#983

vllm-project/vllm-omniPython

A framework for efficient model inference with omni-modality models

4.9k972Apache-2.0

#987

modelscope/ms-swiftPython

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, Phi4, ...) (AAAI 2025).

14.2k1.4kApache-2.0

#992

mlflow/mlflowPython

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

26.1k5.8kApache-2.0

#1032

rikkahub/rikkahubKotlin

RikkaHub is an Android APP that supports for multiple LLM providers.

4.7k311

#1109

langchain4j/langchain4jJava

LangChain4j is an idiomatic, open-source Java library for building LLM-powered applications on the JVM. It offers a unified API over popular LLM providers and vector stores, and makes implementing tool calling (including MCP support), agents and RAG easy. It integrates seamlessly with enterprise Java frameworks like Quarkus and Spring Boot.

12.1k2.3kApache-2.0

#1116

justcallmekoko/ESP32MarauderC++

A suite of WiFi/Bluetooth offensive and defensive tools for the ESP32

10.9k1.3k

#1122

botpress/botpressTypeScript

The open-source hub to build & deploy GPT/LLM Agents ⚡️

14.7k2.3kMIT

#1143

apify/crawleeTypeScript

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

23.4k1.4kApache-2.0

#1151

yasasbanukaofficial/claude-codeTypeScript

🚀 Open source Claude Code CLI source code. Advanced AI Agent for developers. Includes TypeScript codebase for LLM tool-calling, agentic workflows, and terminal UI. Remember this is just the skeleton not the brain itself. Found by Chaofan Shou.

3.3k4.6k

#1154

vllm-project/recipesJavaScript

Common recipes to run vLLM

805277Apache-2.0

#1161

OpenBMB/MiniCPM-VPython

A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone

25.2k2.0kApache-2.0

#1174

Tencent/AI-Infra-GuardPython

A full-stack AI Red Teaming platform securing AI ecosystems via OpenClaw Security Scan, Agent Scan, Skills Scan, MCP scan, AI Infra scan and LLM jailbreak evaluation.

3.8k371Apache-2.0

#1235

labring/FastGPTTypeScript

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.

28.1k7.1k

#1258

mlc-ai/mlc-llmPython

Universal LLM Deployment Engine with ML Compilation

22.7k2.0kApache-2.0

#1290

toon-format/toonTypeScript

🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.

24.3k1.1kMIT

#1330

JetBrains/koogKotlin

Koog is a JVM (Java and Kotlin) framework for building predictable, fault-tolerant and enterprise-ready AI agents across all platforms – from backend services to Android and iOS, JVM, and even in-browser environments. Koog is based on our AI products expertise and provides proven solutions for complex LLM and AI problems

4.2k407Apache-2.0

#1343

harnessclaw/harnessclaw-engineGo

An LLM programming assistant engine built with Go, supporting WebSocket, multi-turn dialogues, tool calling, permission control, and skill extension.

420132Apache-2.0

#1441

the-open-agent/openagentGo

⚡️next-generation personal AI assistant powered by LLM, RAG and agent loops, supporting computer-use, browser-use and coding agent, demo: https://demo.openagentai.org

4.9k566Apache-2.0

#1505

getsentry/sentry-mcpTypeScript

An MCP server for interacting with Sentry via LLMs.

694107

#1550

vllm-project/semantic-routerGo

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

4.2k683Apache-2.0

#1572

Kaden-Schutt/hipfireRust

RDNA-native LLM inference engine in Rust.

38845

#1585

harlan-zw/mdreamTypeScript

☁️ The fastest HTML to markdown convertor on GitHub. Optimized for LLMs and supports streaming.

89059MIT

#1721

ENTERPILOT/GoModelGo

AI gateway written in Go. Lightweight unified OpenAI-compatible API for OpenAI, Anthropic, Gemini, Groq, xAI & Ollama. LiteLLM alternative with observability, guardrails, streaming, costs and usage tracking.

88960MIT

#1765

777genius/agent-teams-aiTypeScript

You're the CTO, agents are your team. They handle tasks on their own, message each other, and review each other's work. You just watch the kanban board and give high-level commands. Codex/Claude/OpenCode(200+ models, 75+ LLM providers, free models no auth). Build your AI company.

966211AGPL-3.0

#1776

ConardLi/easy-datasetJavaScript

A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval

14.3k1.4k

#1791

FalkorDB/FalkorDBC

A super fast Graph Database uses GraphBLAS under the hood for its sparse adjacency matrix graph representation. Our goal is to provide the best Knowledge Graph for LLM (GraphRAG).

4.4k362

#1792

oobabooga/textgenPython

Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.

47.2k6.0kAGPL-3.0

#1839

thedaviddias/llms-txt-hubTypeScript

🤖 The largest directory for AI-ready documentation and tools implementing the proposed llms.txt standard

843460

#1876

nvk/llm-wikiShell

LLM-compiled knowledge bases for any AI agent. Parallel multi-agent research, thesis-driven investigation, source ingestion, wiki compilation, querying, and artifact generation.

45461MIT

#1886

opensolon/solon-aiJava

Java AI application development framework (supports LLM-tool,skill; RAG; MCP; Agent-ReAct,Team-Agent). Compatible with java8 ~ java25. It can also be embedded in SpringBoot, jFinal, Vert.x, Quarkus, and other frameworks.

38758Apache-2.0

#1891

0xPlaygrounds/rigRust

⚙️🦀 Build modular and scalable LLM Applications in Rust

7.4k822MIT

#1892

foldl/chatllm.cppC++

Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)

88069MIT

#1897

ModelTC/LightLLMPython

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

4.1k329Apache-2.0

#1961

Yeti-791/Tsec-HackathonPython

腾讯云智能渗透黑客松 Official repository of Tencent Cloud Intelligent Penetration Hackathon. Showcasing top open-source projects of LLM-based autonomous penetration agents, including multi-agent collaboration, automated penetration, AI-driven offensive security, and intelligent attack-defense solutions.

49258

#1966

pydantic/logfirePython

AI observability platform for production LLM and agent systems.

4.3k237MIT

#2004

Haozhe-Xing/agent_learningHTML

A systematic AI Agent development tutorial covering LLM agents, RAG, tool use, memory systems, multi-agent systems, LangChain, LangGraph, MCP, and agentic RL.｜从零开始学 AI Agent 开发 | 系统、全面、实战导向的 Agent 开发教程 | 每日自动追踪 arXiv 最新论文 | Learn AI Agent Development from Scratch

17925MIT

#2065