Trending
All collections

Machine learning

Models, frameworks, training infra

100 repositories

1
#12
rohitg00/ai-engineering-from-scratch avatar
rohitg00/ai-engineering-from-scratch

Learn it. Build it. Ship it for others.

11.1k
100
2
#73
Fincept-Corporation/FinceptTerminal avatar
Fincept-Corporation/FinceptTerminal

FinceptTerminal is a modern finance application offering advanced market analytics, investment research, and economic data tools, designed for interactive exploration and data-driven decision-making in a user-friendly environment.

22.3k
31
3
#140
Comfy-Org/ComfyUI avatar
Comfy-Org/ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

114.0k
22
4
#147
roboflow/supervision avatar
roboflow/supervision

We write your reusable computer vision tools. 💜

39.5k
21
5
#201
vllm-project/vllm avatar
vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

80.7k
18
6
#203
wanshuiyin/Auto-claude-code-research-in-sleep avatar
wanshuiyin/Auto-claude-code-research-in-sleep

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works with Claude Code, Codex, OpenClaw, or any LLM agent.

10.3k
18
7
#257
rocketride-org/rocketride-server avatar
rocketride-org/rocketride-server

High-performance AI pipeline engine with a C++ core and 50+ Python-extensible nodes. Build, debug, and scale LLM workflows with 13+ model providers, 8+ vector databases, and agent orchestration, all from your IDE. Includes VS Code extension, TypeScript/Python SDKs, and Docker deployment.

3.1k
16
8
#269
rasbt/LLMs-from-scratch avatar
rasbt/LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

95.4k
16
9
#326
OpenBMB/VoxCPM avatar
OpenBMB/VoxCPM

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

19.4k
15
10
#393
liyupi/ai-guide avatar
liyupi/ai-guide

程序员鱼皮的 AI 资源大全 + Vibe Coding 零基础教程,分享 OpenClaw 保姆级教程、大模型玩法(DeepSeek / GPT / Gemini / Claude)、最新 AI 资讯、Prompt 提示词大全、AI 知识百科(Agent Skills / RAG / MCP / A2A)、AI 编程教程(Harness Engineering)、AI 工具用法(Cursor / Claude Code / TRAE / Codex / Copilot)、AI 开发框架教程(Spring AI / LangChain)、AI 产品变现指南,帮你快速掌握 AI 技术,走在时代前沿。本项目为开源文档,已升级为鱼皮 AI 导航网站

14.3k
14
11
#413
remsky/Kokoro-FastAPI avatar
remsky/Kokoro-FastAPI

Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/multiplatform CPU, AMD, NVIDIA GPU PyTorch support, handling, and auto-stitching

4.9k
14
12
#432
OpenBB-finance/OpenBB avatar
OpenBB-finance/OpenBB

Financial data platform for analysts, quants and AI agents.

67.9k
14
13
#467
ultralytics/ultralytics avatar
ultralytics/ultralytics

Ultralytics YOLO 🚀

57.4k
13
14
#500
nautechsystems/nautilus_trader avatar
nautechsystems/nautilus_trader

Production-grade Rust-native trading engine with deterministic event-driven architecture

22.9k
13
15
#521
kyegomez/OpenMythos avatar
kyegomez/OpenMythos

A theoretical reconstruction of the Claude Mythos architecture, built from first principles using the available research literature.

13.3k
13
16
#540
huggingface/transformers avatar
huggingface/transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

160.9k
13
17
#548
NVIDIA/physicsnemo avatar
NVIDIA/physicsnemo

Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods

2.8k
13
18
#559
enactic/openarm avatar
enactic/openarm

A fully open-source humanoid arm for physical AI research and deployment in contact-rich environments.

2.5k
12
19
#568
f/prompts.chat avatar
f/prompts.chat

f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

162.7k
12
20
#577
akfamily/akshare avatar
akfamily/akshare

AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库

19.6k
12
21
#582
NVIDIA/earth2studio avatar
NVIDIA/earth2studio

Open-source deep-learning framework for exploring, building and deploying AI weather/climate workflows.

912
12
22
#584
huggingface/huggingface_hub avatar
huggingface/huggingface_hub

The official Python client for the Hugging Face Hub.

3.6k
12
23
#588
pytorch/pytorch avatar
pytorch/pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

100.1k
12
24
#615
lakehq/sail avatar
lakehq/sail

Drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.

2.6k
12
25
#668
Developer-Y/cs-video-courses avatar
Developer-Y/cs-video-courses

List of Computer Science courses with video lectures.

81.5k
12
26
#699
wkentaro/labelme avatar
wkentaro/labelme

Image annotation with Python. Supports polygon, rectangle, circle, line, point, and AI-assisted annotation.

15.9k
12
27
#762
lance-format/lance avatar
lance-format/lance

Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

6.5k
12
28
#806
screenpipe/screenpipe avatar
screenpipe/screenpipe

YC (S26) | Give AI the ability to live your experience. Records everything you do, say, hear 24/7, local, private, secure

18.8k
12
29
#808
NVIDIA/TensorRT-LLM avatar
NVIDIA/TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

13.7k
12
30
#816
xorbitsai/inference avatar
xorbitsai/inference

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

9.3k
12
31
#828
mayocream/koharu avatar
mayocream/koharu

ML-powered manga translator, written in Rust.

4.4k
11
32
#831
skypilot-org/skypilot avatar
skypilot-org/skypilot

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).

10.0k
11
33
#888
qdrant/qdrant avatar
qdrant/qdrant

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

31.5k
11
34
#919
optuna/optuna avatar
optuna/optuna

A hyperparameter optimization framework

14.2k
11
35
#931
apache/airflow avatar
apache/airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

45.5k
11
36
#962
vllm-project/vllm-omni avatar
vllm-project/vllm-omni

A framework for efficient model inference with omni-modality models

4.9k
11
37
#969
microsoft/ML-For-Beginners avatar
microsoft/ML-For-Beginners

12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all

85.9k
11
38
#971
mlflow/mlflow avatar
mlflow/mlflow

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

26.1k
11
39
#1000
srbhr/Resume-Matcher avatar
srbhr/Resume-Matcher

Improve your resumes with Resume Matcher. Get insights, keyword suggestions and tune your resumes to job descriptions.

27.1k
11
40
#1034
paperless-ngx/paperless-ngx avatar
paperless-ngx/paperless-ngx

A community-supported supercharged document management system: scan, index and archive all your documents

40.9k
11
41
#1055
apache/superset avatar
apache/superset

Apache Superset is a Data Visualization and Data Exploration Platform

72.9k
11
42
#1056
tesseract-ocr/tesseract avatar
tesseract-ocr/tesseract

Tesseract Open Source OCR Engine (main repository)

74.2k
11
43
#1068
microsoft/onnxruntime avatar
microsoft/onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

20.6k
11
44
#1073
scikit-learn/scikit-learn avatar
scikit-learn/scikit-learn

scikit-learn: machine learning in Python

66.1k
11
45
#1085
modelscope/FunASR avatar
modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

16.2k
11
46
#1182
ray-project/kuberay avatar
ray-project/kuberay

A toolkit to run Ray applications on Kubernetes

2.5k
11
47
#1190
netdata/netdata avatar
netdata/netdata

The fastest path to AI-powered full stack observability, even for lean teams.

78.9k
11
48
#1204
deepspeedai/DeepSpeed avatar
deepspeedai/DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

42.4k
11
49
#1317
google/xrblocks avatar
google/xrblocks

XR Blocks is a lightweight WebXR + AI library for rapidly prototyping advanced AI + XR experiences.

403
11
50
#1333
streamlit/streamlit avatar
streamlit/streamlit

Streamlit — A faster way to build and share data apps.

44.7k
11
51
#1368
ray-project/ray avatar
ray-project/ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

42.6k
11
52
#1394
mcmonkeyprojects/SwarmUI avatar
mcmonkeyprojects/SwarmUI

SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.

4.1k
11
53
#1397
marimo-team/marimo avatar
marimo-team/marimo

A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.

21.1k
11
54
#1403
growthbook/growthbook avatar
growthbook/growthbook

Open Source Feature Flags, Experimentation, and Product Analytics

7.8k
11
55
#1411
qubvel-org/segmentation_models.pytorch avatar
qubvel-org/segmentation_models.pytorch

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

11.6k
11
56
#1488
google-ai-edge/mediapipe avatar
google-ai-edge/mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

35.3k
11
57
#1532
TheJudge26/C-Asterisk-Alpha avatar
TheJudge26/C-Asterisk-Alpha

C* (C-Asterisk) is a custom, high-performance programming language. It uses LLVM and native I/O bypasses to run Deep Learning models from scratch 100x faster than Python.

172
10
58
#1534
Kaden-Schutt/hipfire avatar
Kaden-Schutt/hipfire

RDNA-native LLM inference engine in Rust.

388
10
59
#1558
huggingface/peft avatar
huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

21.2k
10
60
#1562
dagucloud/dagu avatar
dagucloud/dagu

Local-first control plane for data scripts, SQL, containers, SSH commands, and AI agent workflows. One binary, file-backed state, no external database or broker. Use your favorite AI agents to create, update, and run your workflows via secure MCP with traceability and logging.

3.4k
10
61
#1564
Nixtla/statsforecast avatar
Nixtla/statsforecast

Lightning ⚡️ fast forecasting with statistical and econometric models.

4.8k
10
62
#1566
voxel51/fiftyone avatar
voxel51/fiftyone

Refine high-quality datasets and visual AI models

10.7k
10
63
#1615
tensorflow/tensorflow avatar
tensorflow/tensorflow

An Open Source Machine Learning Framework for Everyone

195.2k
10
64
#1677
lutzroeder/netron avatar
lutzroeder/netron

Visualizer for neural network, deep learning and machine learning models

32.9k
10
65
#1690
ggml-org/ggml avatar
ggml-org/ggml

Tensor library for machine learning

14.7k
10
66
#1691
tracel-ai/burn avatar
tracel-ai/burn

Burn is a next generation tensor library and Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.

15.2k
10
67
#1703
blakeblackshear/frigate avatar
blakeblackshear/frigate

NVR with realtime local object detection for IP cameras

32.1k
10
68
#1763
gradio-app/gradio avatar
gradio-app/gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

42.7k
10
69
#1813
Tencent/ncnn avatar
Tencent/ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

23.3k
10
70
#1850
ndif-team/nnsight avatar
ndif-team/nnsight

The nnsight package enables interpreting and manipulating the internals of deep learned models.

932
10
71
#1851
ModelTC/LightLLM avatar
ModelTC/LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

4.1k
10
72
#1857
camel-ai/camel avatar
camel-ai/camel

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

17.0k
10
73
#1883
HumanSignal/label-studio avatar
HumanSignal/label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

27.4k
10
74
#1934
PrefectHQ/prefect avatar
PrefectHQ/prefect

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

22.4k
10
75
#1946
pandas-dev/pandas avatar
pandas-dev/pandas

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

48.8k
10
76
#1967
SemiAnalysisAI/InferenceX avatar
SemiAnalysisAI/InferenceX

Open Source Continuous Inference Benchmarking Qwen3.5, DeepSeek, GPTOSS - GB200 NVL72 vs MI355X vs B200 vs GB300 NVL72 vs H100 & soon™ TPUv6e/v7/Trainium2/3

989
10
77
#1979
SwanHubX/SwanLab avatar
SwanHubX/SwanLab

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / Ultralytics / MMEngine / Keras etc.

3.9k
10
78
#1995
lixus7/Time-Series-Works-Conferences avatar
lixus7/Time-Series-Works-Conferences

Time-Series Work Summary in CS Top Conferences (NIPS, ICML, ICLR, KDD, AAAI, WWW, IJCAI, CIKM, ICDM, ICDE, etc.)

960
10
79
#2014
kedro-org/kedro avatar
kedro-org/kedro

Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.

10.9k
10
80
#2015
alibaba/MNN avatar
alibaba/MNN

MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.

15.2k
10
81
#2030
apache/tvm avatar
apache/tvm

Open Machine Learning Compiler Framework

13.4k
10
82
#2141
huggingface/diffusers avatar
huggingface/diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

33.7k
10
83
#2151
torchgeo/torchgeo avatar
torchgeo/torchgeo

TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data

4.0k
10
84
#2247
interpretml/interpret avatar
interpretml/interpret

Fit interpretable models. Explain blackbox machine learning.

6.9k
10
85
#2260
matplotlib/matplotlib avatar
matplotlib/matplotlib

matplotlib: plotting with Python

22.8k
10
86
#2292
memvid/memvid avatar
memvid/memvid

Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval and long-term memory.

15.5k
10
87
#2318
harvard-edge/cs249r_book avatar
harvard-edge/cs249r_book

Machine Learning Systems

24.2k
10
88
#2328
reflex-dev/reflex avatar
reflex-dev/reflex

🕸️ Web apps in pure Python 🐍

28.4k
10
89
#2385
cvat-ai/cvat avatar
cvat-ai/cvat

Computer Vision Annotation Tool (CVAT) is a leading platform for building high-quality visual datasets for vision AI. It offers open-source, cloud, and enterprise products, as well as labeling services, for image, video, and 3D annotation with AI-assisted labeling, quality assurance, team collaboration, analytics, and developer APIs.

15.9k
10
90
#2387
dagster-io/dagster avatar
dagster-io/dagster

An orchestration platform for the development, production, and observation of data assets.

15.6k
10
91
#2454
spiceai/spiceai avatar
spiceai/spiceai

A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.

2.9k
10
92
#2539
arc53/DocsGPT avatar
arc53/DocsGPT

Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.

17.9k
10
93
#2563
stas00/ml-engineering avatar
stas00/ml-engineering

Machine Learning Engineering Open Book

18.0k
10
94
#2566
huggingface/datasets avatar
huggingface/datasets

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

21.5k
10
95
#2593
NVIDIA/cutlass avatar
NVIDIA/cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

9.8k
10
96
#2594
onnx/onnx avatar
onnx/onnx

Open standard for machine learning interoperability

20.9k
10
97
#2599
wandb/wandb avatar
wandb/wandb

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

11.1k
10
98
#2606
KDE/labplot avatar
KDE/labplot

LabPlot is a FREE, open source and cross-platform Data Visualization and Analysis software accessible to everyone.

457
10
99
#2657
alvinreal/awesome-opensource-ai avatar
alvinreal/awesome-opensource-ai

Curated list of the best truly open-source AI projects, models, tools, and infrastructure.

3.6k
10
100
#2670
superiorlu/AITreasureBox avatar
superiorlu/AITreasureBox

🤖 Automatically collected AI repos, tools, websites, papers & tutorials. 实用AI百宝箱 💎

806
10