Skip to content
View michaelfeil's full-sized avatar

Block or report michaelfeil

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. infinity infinity Public

    Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

    Python 2.9k 195

  2. hf-hub-ctranslate2 hf-hub-ctranslate2 Public

    Connecting Transformers on HuggingFace Hub with CTranslate2

    Python 39 2

  3. boschresearch/CNC_Machining boschresearch/CNC_Machining Public archive

    data set for process monitoring on CNC machines

    Jupyter Notebook 142 47

  4. embed embed Public

    A stable, fast and easy-to-use inference library with a focus on a sync-to-async API

    48 2

  5. candle-flash-attn-v3 candle-flash-attn-v3 Public

    C++ 15 2

  6. ai-dynamo/dynamo ai-dynamo/dynamo Public

    A Datacenter Scale Distributed Inference Serving Framework

    Rust 7.4k 1.3k