Search

Find trending repositories by name or description.

100 results for “llms”

#19

Doorman11991/smallcodeJavaScript

AI coding agent optimized for small LLMs. 87% benchmark with 4B-active model.

96766MIT

100

#107

QuantumNous/new-apiGo

A unified AI model hub for aggregation & distribution. It supports cross-converting various LLMs into OpenAI-compatible, Claude-compatible, or Gemini-compatible formats. A centralized gateway for personal and enterprise model management. 🍥

34.8k7.8kAGPL-3.0

#131

LearningCircuit/local-deep-researchPython

~95% on SimpleQA (e.g. Qwen3.6-27B on a 3090). Supports all local and cloud LLMs (llama.cpp, Ollama, Google, ...). 10+ search engines - arXiv, PubMed, your private documents. Everything Local & Encrypted.

7.9k681MIT

#201

vllm-project/vllmPython

A high-throughput and memory-efficient inference and serving engine for LLMs

80.7k17.1kApache-2.0

#212

yamadashy/repomixTypeScript

📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more.

25.3k1.3kMIT

#251

lemonade-sdk/lemonadeC++

Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https://discord.gg/5xXzkMu8Zk

4.1k314Apache-2.0

#266

infiniflow/ragflowPython

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

81.0k9.3kApache-2.0

#269

rasbt/LLMs-from-scratchJupyter Notebook

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

95.4k14.6k

#281

browser-use/browser-harnessPython

Browser Harness | Self-healing harness that enables LLMs to complete any task.

13.5k1.2kMIT

#289

AstrBotDevs/AstrBotPython

AI Agent Assistant & development framework that integrates lots of IM platforms, LLMs, plugins and AI feature, and can be your openclaw alternative. ✨

32.8k2.3kAGPL-3.0

#300

upstash/context7TypeScript

Context7 Platform -- Up-to-date code documentation for LLMs and AI code editors

55.9k2.6kMIT

#329

jax-ml/scaling-bookHTML

Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs

999141MIT

#352

looplj/axonhubGo

⚡️ Open-source AI Gateway — Use any SDK to call 100+ LLMs. Built-in failover, load balancing, cost control & end-to-end tracing.

3.9k475

#400

PaddlePaddle/PaddleOCRPython

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

78.3k10.5kApache-2.0

#444

CherryHQ/cherry-studioTypeScript

AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs

46.1k4.4kAGPL-3.0

#713

hiyouga/LlamaFactoryPython

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

71.5k8.7kApache-2.0

#796

vllm-project/llm-compressorPython

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

3.3k519Apache-2.0

#808

NVIDIA/TensorRT-LLMPython

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

13.7k2.4k

#900

ThinkWatchProject/ThinkWatchRust

Enterprise AI bastion host for secure AI API and MCP access, with unified proxying, RBAC, audit logs, rate limiting, and cost tracking across OpenAI, Anthropic, Gemini, and self-hosted LLMs.

97119

#946

mudler/LocalAIGo

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

46.4k4.1kMIT

#966

modelscope/ms-swiftPython

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, Phi4, ...) (AAAI 2025).

14.2k1.4kApache-2.0

#971

mlflow/mlflowPython

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

26.1k5.8kApache-2.0

#1117

apify/crawleeTypeScript

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

23.4k1.4kApache-2.0

#1202

labring/FastGPTTypeScript

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.

28.1k7.1k

#1468

getsentry/sentry-mcpTypeScript

An MCP server for interacting with Sentry via LLMs.

694107

#1546

harlan-zw/mdreamTypeScript

☁️ The fastest HTML to markdown convertor on GitHub. Optimized for LLMs and supports streaming.

89059MIT

#1751

oobabooga/textgenPython

Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.

47.2k6.0kAGPL-3.0

#1795

thedaviddias/llms-txt-hubTypeScript

🤖 The largest directory for AI-ready documentation and tools implementing the proposed llms.txt standard

843460

#2015

alibaba/MNNC++

MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.

15.2k2.3kApache-2.0

#2060

Kwwwww74/Awesome-Trustworthy-AudioLLMs

A reading list for trustworthy audio large language models.

1967

#2133

mozilla-ai/llamafileC++

Distribute and run LLMs with a single file.

24.5k1.4k

#2307

MindWorkAI/AI-StudioC#

MindWork AI Studio is a free, independent cross-platform desktop app for local and cloud LLMs across providers, built to democratize AI access.

48450

#2529

xming521/WeClonePython

🚀 One-stop solution for creating your AI twin from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life.

17.9k1.5kAGPL-3.0

#2653

chorus-codes/chorusTypeScript

Multi-LLM peer review for code decisions. Bring your own CLI; Chorus convenes 2-4 other LLMs to review the work before you ship.

48249Apache-2.0

#2762

google/langextractPython

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

36.5k2.5kApache-2.0

#2883

ludwig-ai/ludwigPython

Low-code framework for building custom LLMs, neural networks, and other AI models

11.7k1.2kApache-2.0

#2990

tetherto/qvacJavaScript

QVAC - Local AI SDK and libraries for building private, cross-platform, peer-to-peer AI applications. Run LLMs, speech-to-text, translation, and more locally on Linux, macOS, Windows, Android, and iOS.

18657Apache-2.0

#3516

theJayTea/WritingToolsSwift

The world's smartest system-wide grammar assistant; a better version of the Apple Intelligence Writing Tools. Works on Windows, Linux, & macOS, with the free Gemini API, local LLMs, & more.

2.3k141GPL-3.0

#3529

LazyAGI/LazyLLMPython

Easiest and laziest way for building multi-agent LLMs applications.

3.8k389Apache-2.0

#3576

microsoft/LMOpsPython

General technology for enabling AI capabilities w/ LLMs and MLLMs

4.4k373MIT

#3925

LukasNiessen/terrashark

Terraform Skill for Claude Code and Codex. LLMs hallucinate a lot with Terraform - TerraShark fixes this. It eliminates hallucinations, is designed for modular and secure code and grounds your IaC in the official Hashicorp Terraform best practices.

45219MIT

#4123

ggml-org/LlamaBarnSwift

A cosy home for your LLMs.

1.3k70MIT

#4221

PaddlePaddle/FastDeployPython

High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle

3.7k744Apache-2.0

#4733

browser-use/browser-harness-jsTypeScript

Self-healing browser harness that enables LLMs to complete any task

45527MIT

#4785

tlysanhuo/aminer-daily-paperPython

Personalized paper recommendation for OpenClaw / Feishu, powered by AMiner + arXiv + LLMs.

4970MIT

#5050

aliyun/qwen-dianjinPython

Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud（通义点金：阿里云金融大模型）

45340

#5106

Tanq16/ai-contextGo

CLI tool to produce MD context files from many sources, to help interact with LLMs (ChatGPT, Llama3, Claude, etc.).

17313MIT

#5134

OpenDCAI/DataFlowPython

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

3.9k405Apache-2.0

#5231

ad-si/caiRust

User friendly CLI tool for AI tasks. Stop thinking about LLMs and prompts, start getting results!

19912ISC

#5248

rllm-team/rllmPython

Pytorch Library for Relational Table Learning with LLMs.

44034MIT

#5283

expectedparrot/edslPython

Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.

45973MIT

#5307

KudoAI/googlegptJavaScript

🤖 AI chat & search summaries in Google Search, powered by the latest LLMs

18417

#5397

kimtth/awesome-azure-openai-llmPython

A curated collection of resources for 🌌 Azure OpenAI, 🦙 LLMs (+RAG, Agents). Monthly Updates.

40155

#7093

mlabonne/llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

79.3k9.2kApache-2.0

#7097

nomic-ai/gpt4allC++

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

77.4k8.3kMIT

#7344

ParisNeo/lollms-webuiPython

Lord of Large Language and Multi modal Systems Web User Interface

4.8k584Apache-2.0

#7518

InternLM/lmdeployPython

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

7.9k697Apache-2.0

#7591

mixelpixx/KiCAD-MCP-ServerPython

KiCAD MCP is a Model Context Protocol (MCP) implementation that enables Large Language Models (LLMs) like Claude to directly interact with KiCAD for printed circuit board design.

996171MIT

#7644

StartupHakk/OpenMonoAgent.aiC#

(BETA) AI shouldn't have a meter. Unlimited tokens. Forever. Your machine. Your agent. Use it from anywhere. Terminal-native coding agent powered by local LLMs — 100% open source, free forever, and installed with a single command. Proudly built on C#/.NET, because AI tooling should be infrastructure, not a subscription.

967108

#7668

luhengshiwo/LLMForEverybodyJupyter Notebook

每个人都能看懂的大模型知识分享，LLMs春/秋招大模型面试前必看，让你和面试官侃侃而谈

6.5k606Apache-2.0

#7965

stanford-crfm/helmPython

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparent evaluation of foundation models, including large language models (LLMs) and multimodal models.

2.8k385Apache-2.0

#8015

mangiucugna/json_repairPython

Repair malformed JSON from LLMs, APIs, logs, and user input in Python.

4.9k195MIT

#8136

taishi-i/awesome-japanese-nlp-resources

A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese. Includes a Claude Code plugin with skills to search resources.

95944CC0-1.0

#8195

Portkey-AI/gatewayTypeScript

A blazing fast AI Gateway with integrated guardrails. Route to 1,600+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.

11.8k1.1kMIT

#8245

EricLBuehler/candle-vllmRust

Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.

65580MIT

#8339

IAAR-Shanghai/Awesome-AI-MemoryPython

Awesome AI Memory | LLM Memory | A curated knowledge base on AI memory for LLMs and agents, covering long-term memory, reasoning, retrieval, and memory-native system design. Awesome-AI-Memory 是一个集中式、持续更新的 AI 记忆知识库，系统性整理了与大模型记忆（LLM Memory）与智能体记忆（Agent Memory）相关的前沿研究、工程框架、系统设计、评测基准与真实应用实践。

86578Apache-2.0

#8863

AmberLJC/LLMSys-PaperList

Large Language Model (LLM) Systems Paper List

2.0k101

#8921

modelpack/model-specGo

An Open Standard for Packaging, Distributing and Running LLMs in Cloud-Native Environments

19935Apache-2.0

#8982

NovaSky-AI/SkyRLPython

SkyRL: A Modular Full-stack RL Library for LLMs

1.9k322Apache-2.0

#9033

NVIDIA-NeMo/AutomodelPython

🚀 Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

500148Apache-2.0

#9274

zh-plus/openlrcPython

Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT，Claude等)来转录、翻译你的音频为字幕文件。

65450MIT

#9334

oil-oil/wolfchaTypeScript

AI-powered Werewolf (Mafia) social deduction game where every player is controlled by top LLMs like DeepSeek, Qwen, Gemini, and more

59646Apache-2.0

#9551

pipecat-ai/gradient-bangTypeScript

Gradient Bang is an online multiplayer universe where you explore, trade, battle, and collaborate with other players and with LLMs

OO for LLMs

nobodywho-ooo/nobodywhoRust

NobodyWho is an inference engine that lets you run LLMs locally and efficiently on any device.

93264EUPL-1.2

#10365

open-lm-engine/lm-enginePython

LM engine is a library for pretraining/finetuning LLMs

17129Apache-2.0

#10505

pettingllms-ai/PettingLLMsPython

[ICLR'26] Stronger-MAS: A RL Framework for multi LLM agent system; [arxiv] MetaAgent-X: End-to-End Reinforcement Learning Automatic Multi-Agent Systems

16727MIT

#10548

EgoAlpha/prompt-in-context-learningJupyter Notebook

Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.

2.2k193MIT

#10592

noonghunna/club-3090Python

Community recipes for serving LLMs on RTX 3090. Multi-engine (vLLM, llama.cpp, SGLang) and model-agnostic. Currently shipping Qwen3.6-27B configs for 1× and 2× cards.

99749Apache-2.0

#10656

openai/evalsPython

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

18.5k3.0k

#10818

567-labs/instructorPython

structured outputs for llms

13.0k1.0kMIT

#11239

MervinPraison/PraisonAIPython

PraisonAI 🦞 — Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous self-improving agents that research, plan, code, and execute tasks. Deployed in 5 lines of code with built-in memory, RAG, and support for 100+ LLMs.

7.8k1.2kMIT

#11441

huhusmang/Awesome-LLMs-for-Vulnerability-DetectionPython

Awesome Large Language Models for Vulnerability Detection

81060

#11487

trvon/yamsC++

Persistent memory for LLMs and apps. Content-addressed storage with dedupe, compression, full-text and vector search.

3739GPL-3.0

#11511

TeleAI-UAGI/Awesome-Agent-Memory

Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.

41834Apache-2.0

#11524

TracyWang95/DataInfra-RedactionEverythingTypeScript

DataInfra Series. Redact EVERYTHING with local llms and vlms.

17126

#11626

EnnengYang/Awesome-Model-Merging-Methods-Theories-Applications

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.

73144

#11755

notebook-intelligence/notebook-intelligencePython

A JupyterLab extension supporting Claude Code, Copilot, Ollama, and OpenAI-compatible LLMs, with MCP, skills, plugins, and notebook agents.

29854GPL-3.0

#12026

qualcomm/nexa-sdkKotlin

Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mobile (Android & iOS), and Linux/IoT (Arm64 & x86 Docker). Supporting OpenAI GPT-OSS, IBM Granite-4, Qwen-3-VL, Gemma-3n, Ministral-3, and more.

8.0k997Apache-2.0

#12038

kardolus/chatgpt-cliGo

ChatGPT CLI is a powerful, multi-provider command-line interface for working with modern LLMs. It supports OpenAI, Azure, Perplexity, LLaMA, and more, with features like streaming, interactive chat, prompt files, image/audio I/O, MCP tool calls, and an experimental agent mode for safe, multi-step automation.

92361MIT

#12068

yifanfeng97/Hyper-ExtractPython

Transform unstructured text into structured knowledge with LLMs. Graphs, hypergraphs, and spatio-temporal extractions — with one command.

919104

#12488

milanglacier/minuet-ai.elEmacs Lisp

💃 Dance with LLM in Your Code. Minuet offers code completion as-you-type from popular LLMs including OpenAI, Gemini, Claude, Ollama, Llama.cpp, Codestral, and more.

35514GPL-3.0

#12514

beihaili/Get-Started-with-Web3JavaScript

Open-source bilingual AI-native Web3 curriculum: wallets, Bitcoin, Ethereum, DeFi, L2, DAO, smart accounts, llms.txt and MCP

61456MIT

#12574

RekklesNA/ProxmoxMCP-PlusPython

Use MCP and OpenAPI to safely control Proxmox VE VMs, LXCs, backups, and snapshots from LLMs and AI agents.

19351MIT

#13041

MuLabPKU/TransArchPython

Design hardware-friendly model architectures and migrate existing LLMs with minimal performance loss

47230

#13413

SethRobinson/aitools_clientC#

Seth's AI Tools: A Unity based front end that uses ComfyUI and LLMs to create stories, images, movies, quizzes and posters

18222

#13620

local-inference-lab/rtx6kproPython

RTX 6000 Pro Wiki — Running Large LLMs (Qwen3.5-397B, Kimi-K2.5, GLM-5) on PCIe GPUs without NVLink

33724

#13908

ymcui/Chinese-LLaMA-AlpacaPython

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

18.9k1.9kApache-2.0

#14049

Jacobinwwey/obsidian-NotEMDTypeScript

A Easy way to create your own Knowledge-base! Notemd enhances your Obsidian workflow by integrating with various Large Language Models (LLMs) to process your notes, automatically generate wiki-links for key concepts, create corresponding concept notes, perform web research, and more.

1967MIT

100

#14186

hyperfield/ai-file-sorterC++

Cross-platform desktop application for content-aware file organization and renaming. Supports local and remote LLMs, preview-based workflows, and fully user-controlled changes.

991110AGPL-3.0