Find trending repositories by name or description.
A Datacenter Scale Distributed Inference Serving Framework
A scalable, distributed, collaborative, document-graph database, for the realtime web
zenoh unifies data in motion, data in-use, data at rest and computations. It carefully blends traditional pub/sub with geo-distributed storages, queries and computations, while retaining a level of time and space efficiency that is well beyond any of the mainstream stacks.
SeaweedFS is a distributed storage system for object storage (S3), file systems, and Iceberg tables, designed to handle billions of files with O(1) disk access and effortless horizontal scaling.
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
Tdarr - Distributed transcode automation using FFmpeg/HandBrake + Audio/Video library analytics + video health checking (Windows, macOS, Linux & Docker)
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Cloud-Native distributed storage built on and for Kubernetes
Apache Kafka - A distributed event streaming platform
🔎 Open source distributed and RESTful search engine.
eBPF Observability - Distributed Tracing and Profiling
YugabyteDB - the cloud native distributed SQL database for mission-critical applications.
A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training
Simple, reliable, and efficient distributed task queue in Go
DORA (Dataflow-Oriented Robotic Architecture) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dataflow capabilities. Applications are modeled as directed graphs, also referred to as pipelines.
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
eBPF distributed networking observability tool for Kubernetes
A distributed, fast open-source graph database featuring horizontal scalability and high availability
A distributed task scheduling framework.(分布式任务调度平台XXL-JOB)
Distributed Task Queue (development branch)
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.
CNCF Jaeger, a Distributed Tracing Platform
Free and Open Source, Distributed, RESTful Search Engine
Ceph is a distributed object, block, and file storage platform
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
I2P is an anonymizing network, offering a simple layer that identity-sensitive applications can use to securely communicate. All data is wrapped with several layers of encryption, and the network is both distributed and dynamic, with no trusted parties.
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
Apache Pulsar - distributed pub-sub messaging system
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
The Fastest Distributed Database for Transactional, Analytical, and AI Workloads.
Library for building distributed, real-time collaborative web applications
Dapr is a portable runtime for building distributed applications across cloud and edge, combining event-driven architecture with workflow orchestration.
Distributed lock for your scheduled tasks
Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.
Open source transactional distributed database. Linear scalability and proven fault-tolerance on commodity hardware or cloud infrastructure without compromising performance.
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
Apache Pinot - A realtime distributed OLAP datastore
Consul is a distributed, highly available, and data center aware solution to connect and configure applications across dynamic, distributed infrastructure.
Open source framework for building robust type-safe distributed systems with declarative infrastructure
:fire: Seata is an easy-to-use, high-performance, open source distributed transaction solution.
JanusGraph: an open-source, distributed graph database
Premium Queue package for handling distributed jobs and messages in NodeJS.
Spring Cloud Alibaba provides a one-stop solution for application development for the distributed solutions of Alibaba middleware.
Distributed reliable key-value store for the most critical data of a distributed system
A Kubernetes operator for managing Garage - a distributed S3-compatible object storage system designed for self-hosting.
Enterprise-grade Distributed Asset & Vulnerability Scanner. Features: Port Scanning, Subdomain Brute-force, Fingerprinting, and PoC Detection. Built on Go-Zero & Vue3. 高性能分布式网络资产扫描平台 (子域名扫描/端口扫描/指纹识别/弱口令爆破/JS识别/POC扫描)----------------------------点击下面URL获取高级POC
PDK installer for open-source EDA tools and toolchains. Distributed with setups for the SkyWater 130nm and Global Foundries 180nm open processes.
A GraphQL execution engine for distributed schemas
Quick deployment, distributed, multi-user, modern management panel for Minecraft and Steam game servers / 快速安装,分布式架构,多用户销售,现代化的 Minecraft 和 Steam 游戏服务器管理面板
Distributed platform for building autonomic network functions.
INDIGO is a system of standards and frameworks for multiplatform and distributed astronomy software development designed to scale with your needs.
An extensible distributed system for reliable nearline data streaming at scale
The complete load testing platform. Everything you need for production-grade load tests. Serverless & distributed. Load test with Playwright. Load test HTTP APIs, GraphQL, WebSocket, and more. Use any Node.js module.
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. It is PostgreSQL-compatible, and based on Lucene.
OrientDB is the most versatile DBMS supporting Graph, Document, Reactive, Full-Text and Geospatial models in one Multi-Model product. OrientDB can run distributed (Multi-Master), supports SQL, ACID Transactions, Full-Text indexing and Reactive Queries.
The official home of the Presto distributed SQL query engine for big data
Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
Git based distributed ticketing system, built on git-meta
Centralized network visibility and continuous asset discovery. Monitor devices, detect change, and stay aware across distributed networks.
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Distributed PostgreSQL as an extension
Distributed stream processing engine in Rust
Golem Cloud is the agent-native platform for building AI agents and distributed applications that never lose state, never duplicate work, and never require you to build infrastructure.
A music player that connects to your cloud/distributed storage.
Cadence is a distributed, scalable, durable, and highly available orchestration engine to execute asynchronous long-running business logic in a scalable and resilient way.
A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search scenario.
Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes
Tendis is a high-performance distributed storage system fully compatible with the Redis protocol.
M3 monorepo - Distributed TSDB, Aggregator and Query Engine, Prometheus Sidecar, Graphite Compatible, Metrics Platform
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
A toolchain for building scalable, enterprise-ready component systems on top of TypeScript and Web Component standards. Stencil components can be distributed natively to React, Angular, Vue, (+ more) and traditional web applications from a single, framework-agnostic codebase.
Work in progress distributed system that simplifies the orchestration of containers and virtual machines.
Distributed tracing without code changes. 🚀 Instantly monitor any application using OpenTelemetry and eBPF
🚀 Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support
Next generation distributed, event-driven, parallel config management!
Zadig: An AI-powered, cloud-native, distributed DevOps platform designed for developers
High-performance distributed multi-tier cache system. Built in Rust.
Apache Kvrocks is a distributed key value NoSQL database that uses RocksDB as storage engine and is compatible with Redis protocol.
Distributed AI/LLM for the people. Share compute privately or publicly to power your agents and chat.
A distributed go-datastore implementation using Merkle-CRDTs.
Apache BifroMQ (Incubating) is a high-performance, distributed MQTT broker that natively supports multi-tenancy. It is designed to enable the building of large-scale IoT device connectivity and messaging systems.
The first distributed AGI system. Thousands of autonomous AI agents collaboratively train models, share experiments via P2P gossip, and push breakthroughs here. Fully peer-to-peer. Join from your browser or CLI.
MQTT Server/MQTT Broker - Scalable Distributed MQTT Message Broker for IoT in the 5G Era
Nornicdb is a distributed low-latency, Graph+Vector, Temporal MVCC with all sub-ms HNSW search, graph traversal, and writes. Using Neo4j Bolt/Cypher and qdrant's gRPC means you can switch with no changes. Then, adding intelligent features like schemas, managed embeddings, LLM reranking+inferrence, GPU accel, Auto-TLP, Memory Decay, and MCP server.
A Ruby Gem to detect under what license a project is distributed.
A multithreaded, distributed, GPU-accelerated simulator of quantum computers
Native metadata service for distributed filesystems, object storage, and AI dataset workloads.
A purely functional library for building distributed and event-driven systems.
A high-performance distributed in-memory object cache system built from scratch in C++17, compatible with Redis protocol.
A distributed framework for LLM agents
Agent skills by mizchi, distributed via APM
Distributed High-Performance Symbolic Regression in Julia
Durable, Distributed runtime for ALL of your agents - OpenAI, ADK, Langchain, Vercel, etc.
Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrastructure for AI development at scale.
MIRROR of the SimGrid framework, for the simulation of distributed applications (Clouds, HPC, Grids, IoT and others). Most of the dev occurs on FramaGit.
Spacedrive is an open source cross-platform file explorer, powered by a virtual distributed filesystem written in Rust.
FoundationDB - the open source, distributed, transactional key-value store