njhill - Overview Skip to content Navigation Menu Pricing Provide feedback Saved searches Use saved searches to filter your results more quickly Sign up Appearance settings Pinned Loading A high-throughput and memory-efficient inference and serving engine for LLMs Python 83.4k 18.3k IBM development fork of https://github.com/huggingface/text-generation-inference Python 65 35 Alternative etcd3 java client Java 163 42 Distributed Model Serving Framework Java 188 79 Netty project - an event-driven asynchronous network application framework Java 35k 16.3k Abstracted helper classes providing consistent key-value store functionality, with zookeeper and etcd3 implementations Java 6 3