Zaloguj sięZarejestruj się
Anyscale
1,936 posts
user avatar
Anyscale
@anyscalecompute
The best way to run @raydistributed, the distributed compute framework for scaling AI workloads.
anyscale.com
Data dołączenia: September 2019
3
Obserwujesz
13K
Obserwujący

Dopiero zaczynasz korzystać z serwisu X?

Zarejestruj się teraz, żeby stworzyć własną, spersonalizowaną oś czasu!

Utwórz konto

Rejestrując się, akceptujesz Zasady użytkowania i Politykę prywatności, w tym Korzystanie z plików cookie.

Terms·Privacy·Cookies·Dostępność·Ads Info·© 2026 X Corp.
Don't miss what's happening
Użytkownicy serwisu X wiedzą jako pierwsi.
Zaloguj sięZarejestruj się
  • user avatar
    Anyscale
    @anyscalecompute
    13 gru 2023
    We’re excited to announce the official @MistralAI Mixtral 8x7B model on Anyscale Endpoints, offering the best price on the market with an OpenAI compatible API. 💸 Pricing: $0.5 / million tokens 📆 Coming soon: JSON mode and function calling Try out Mixtral on Anyscale
    547K
  • user avatar
    Anyscale
    @anyscalecompute
    21 gru 2023
    📈We’re excited to introduce the LLMPerf leaderboard: the first public and open source leaderboard for benchmarking performance of various LLM inference providers in the market. Our goal with this leaderboard is to equip users and developers with a clear understanding of the
    376K
  • user avatar
    Anyscale
    @anyscalecompute
    12 gru 2023
    We're announcing new features and models today. 🔵 JSON mode ⚫️ function calling Try them out with our API.
    Blog | Anyscale
    Z anyscale.com
    174K
  • user avatar
    Anyscale
    @anyscalecompute
    1 lip 2024
    1/ 🚀 Introducing RouteLLM: a routing framework based on human preference data for routing queries between powerful proprietary LLMs and cost-effective LLMs, developed in collaboration with @lmsysorg . By intelligently selecting the best model for each query, our router models
    20K
  • user avatar
    Anyscale
    @anyscalecompute
    16 lip 2020
    Anyscale is honored to have been recognized by Forbes as one of America’s Most Promising Artificial Intelligence Companies anyscale.com/blog/forbes-ai…
  • user avatar
    Anyscale
    @anyscalecompute
    10 lip 2024
    We’ve recently contributed FP8 support to the @vllm_project in collaboration with @neuralmagic. With this feature, you can see up to a 1.8x reduction in inter-token latency, with >99% accuracy preservation! 1/n
    34K
  • user avatar
    Anyscale
    @anyscalecompute
    22 gru 2023
    🔥 Mixtral-8x7B JSON Mode and Function Calling API is now available on Anyscale Endpoints! Empirically, we observed noticeable improvements in response to tool messages by Mixtral MoE, compared @MistralAI 7B. 🚀 👇 Try it out: app.endpoints.anyscale.com
    29K
  • user avatar
    Anyscale
    @anyscalecompute
    28 maj 2024
    Recently, we’ve contributed chunked prefill to @vllm_project, leading to up to 2x speedup for higher QPS regimes! In vLLM, prefilling, which fills the KV cache, and decoding, which outputs new tokens, can interfere with each other, resulting in latency degradation. 1/n
    33K
  • user avatar
    Anyscale
    @anyscalecompute
    7 gru 2021
    🎉 We’re thrilled to announce our $100M Series C led by @a16z & Addition + general availability of the Anyscale managed @raydistributed platform! forbes.com/sites/kenrickc… Both are big steps forward in our mission to accelerate the scaling and productionization of #AI apps.
  • user avatar
    Anyscale
    @anyscalecompute
    22 paź 2025
    Today we’re donating Ray to The Linux Foundation under the PyTorch Foundation with PyTorch + vLLM, strengthening the open compute fabric for AI. Ensures long-term neutrality, open governance, and ecosystem alignment. Blog: na2.hubs.ly/H01JydX0 Ray Summit (Nov 3–5, SF):
    22K
  • user avatar
    Anyscale
    @anyscalecompute
    16 sty 2024
    Producing ~1B embeddings can take weeks and cost tens of thousands of dollars ($60K with OpenAI in the example below). We are thrilled to partner with @pinecone on the launch of their new serverless offering! Anyscale + Pinecone reduce the cost of computing these embeddings by
    18K
  • user avatar
    Anyscale
    @anyscalecompute
    30 sty 2024
    Ready to hear from #RAG experts at @langchain @vercel @pinecone @anyscalecompute and get hands-on with intensive guided trainings? The 2-day RAG Developer Bootcamp is for you! Learn more & register now 👉hubs.ly/Q02j9QmW0 #llm #ml #rag #ai #vectordatabase #ray #pinecone
    57K
  • user avatar
    Anyscale
    @anyscalecompute
    18 kwi 2024
    🦙 We're excited to host @Meta Llama-3 8b and 70b on Anyscale Endpoints! ➕ Fine-tuning, JSON mode and function calling support coming soon as well! Pricing: - 8B: $0.15 / Million tokens - 70B: $1.00 / Million tokens
    10K
  • user avatar
    Anyscale
    @anyscalecompute
    31 maj 2023
    Our Eng team has been learning a lot about managing and scaling open source LLMs, using some great models from @huggingface . We packaged up what we've learned to help you build/deploy your own! Announcing Aviary: Open Source Multi-LLM Serving
    Blog | Anyscale
    Z anyscale.com
    12K