Anyscale

Software Development

San Francisco, California 60,970 followers

Scalable compute for AI and Python. Creators of Ray distributed compute framework.

See jobs Follow

Discover all 766 employees

About us

Anyscale enables Python developers to build and run all their AI—from data prep to training and inference—at any scale. Anyscale is trusted by leading AI teams at Canva, TripAdvisor, Physical Intelligence, Coinbase and more.

Website: https://anyscale.com
External link for Anyscale
Industry: Software Development
Company size: 201-500 employees
Headquarters: San Francisco, California
Type: Privately Held
Founded: 2019

Employees at Anyscale

Patrick Lonergan

LinkedIn Member
LinkedIn Member
LinkedIn Member
LinkedIn Member
LinkedIn Member

View 766 employees at Anyscale

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

See all employees

Locations

Primary

600 Harrison St

San Francisco, California 94107, US

Get directions
411 High St

Palo Alto, California 94301, US

Get directions

Updates

Anyscale

60,970 followers
12h
Report this post
As platform teams begin supporting agentic AI systems, many are discovering that the infrastructure built for cloud-native applications doesn't naturally extend to AI workloads. Kubernetes excels at scaling stateless services, but AI introduces fundamentally different workload patterns: ▪️Training needs distributed scheduling, fault tolerance, and fair GPU sharing. ▪️Inference demands low-latency serving, efficient GPU utilization, and cost-aware placement. ▪️Reinforcement learning loops combine data processing, training, simulation, and inference into a single continuous workflow. If simply trying to run containers with AI models on K8s, teams run into GPU contention, fragmented tooling, scheduling complexity, and infrastructure that wasn't designed for multiple AI workload types. The next evolution isn't replacing Kubernetes, it's extending it with AI-native workload orchestration, multi-workload support, and smarter GPU scheduling. Learn more from Christian Stano on how to address this with Ray on Anyscale from his session at PlatformCon 2026 from Platform Engineering: https://lnkd.in/gK_3aMRv
Like Comment Share
Anyscale reposted this
Marcell Ferencz
16h
Report this post
I got to present our team's work on how we scale vision AI inference on satellite imagery to continental scale using Ray on Anyscale. A big thanks to the Anyscale team for hosting us and for the invaluable technical guidance they gave us throughout. https://lnkd.in/e238HpYd Milos Colic Vuong Nguyen Pritimoy Podder Pablo Hidalgo Ryan Bashir Ali Sezer Alexandr Plashchinsky

How Adyen trains a Transaction Foundation Model (TFM) on 51 trillion tokens and other stories on scaling AI with Ray from Xoople, Criteo, and BMW | Anyscale anyscale.com

1 Comment

Like Comment Share
Anyscale

60,970 followers
1d
Report this post
Most GPU platforms are built for one user, one cluster, one workload at a time. At Geotab, GPU Docker images took 20 to 30 minutes to load, one researcher occupying a machine meant everyone else waited, and every new team needed its own Terraform setup just to get started. With Anyscale, they built a platform where data scientists can annotate a job with "GPU = 0.1", get exactly that fraction of a GPU, and run alongside a dozen other workloads simultaneously, all without touching Kubernetes. Image load times dropped to 4 to 5 minutes. GPU utilization improved 4x. And the platform team now supports a growing organization of researchers without growing the infrastructure burden alongside it. Full case study: https://lnkd.in/gftDrqb9
Like Comment Share
Anyscale reposted this
Robert Nishihara
1d
Report this post
Try Ray 2.56!

Richard Liaw

Anyscale
2d

🚀 Ray 2.56 just landed! The team has been doing a lot of work to reduce OOMs and unnecessary spilling in Ray Data pipelines, driven by improvements in Ray Data memory management, better defaults, better process management, and more. In our testing, we’ve seen: 📉 Batch inference pipelines go from 300+ OOMs in 2.55 to 0 in 2.56 ⚡ Training data pipelines with local shuffle improve throughput by 3x ⏱️ Ray Data scheduling loop latency reduce by 6x at 2,000-worker scale 🧹 Training pipelines that previously spilled over 70 GB in 2.55 drop down to zero spilling in 2.56 If you’ve run into Ray Data issues in the past, we encourage you to try Ray Data 2.56! Read more on the release blog: https://lnkd.in/gK6xuBwN

Ray Data 2.56: Improving Reliability for AI Data Pipelines | Anyscale anyscale.com

1 Comment

Like Comment Share
Anyscale reposted this
Richard Liaw
2d
Report this post
🚀 Ray 2.56 just landed! The team has been doing a lot of work to reduce OOMs and unnecessary spilling in Ray Data pipelines, driven by improvements in Ray Data memory management, better defaults, better process management, and more. In our testing, we’ve seen: 📉 Batch inference pipelines go from 300+ OOMs in 2.55 to 0 in 2.56 ⚡ Training data pipelines with local shuffle improve throughput by 3x ⏱️ Ray Data scheduling loop latency reduce by 6x at 2,000-worker scale 🧹 Training pipelines that previously spilled over 70 GB in 2.55 drop down to zero spilling in 2.56 If you’ve run into Ray Data issues in the past, we encourage you to try Ray Data 2.56! Read more on the release blog: https://lnkd.in/gK6xuBwN

Ray Data 2.56: Improving Reliability for AI Data Pipelines | Anyscale anyscale.com

Like Comment Share
Anyscale reposted this
Keerti Melkote
2d Edited
Report this post
Robert Nishihara says “Inference is a subroutine of larger more complex AI pipelines”. This is a very succinct way to understand what is happening in AI right now. AI projects are graduating from custom inference to custom models. The business imperative is shifting from simply lower costs to owning a moat. The moat is the data and the AI learning loop. Learning loops require complex orchestration of rollouts, data, evals, policy updates and more across a heterogeneous compute estate of GPUs and CPUs. Inference is a subroutine in this context. It’s still critical. But a part of a whole that is more complex. For this new era of AI, composability becomes a key aspect without giving up on performance. Ray is the backbone for this era with Ray Serve as the most ergonomic way for developers to compose model serving as a part of the AI learning loop. But that is not an excuse for lower performance. Performance still matters in this context. This is why we have focused on improving Ray Serve performance 4.4x for prefill and 28x for decode stages. We are excited for what this does to unify the disparate parts of the AI learning loop into a single cohesive AI backbone for all your varied workload needs. Read more about the performance optimizations in this blog: https://lnkd.in/gVdsg7cj Try it out in Ray 2.56 or easier still on Anyscale, and join us on the Ray Slack to share feedback!

High Performance Distributed Inference with Ray Serve LLM | Anyscale anyscale.com

Like Comment Share
Anyscale

60,970 followers
6d
Report this post
Evaluating a robot foundation model is one of the most demanding closed-loop problems in robotics. Before you can trust a policy on a real robot, you need to validate it across thousands of starting conditions, pairing GPU-heavy model inference with GPU-heavy physics simulation, step by step. At scale, evaluation quickly becomes an infrastructure challenge, not just a robotics problem. In this new blog, Ian D. Jordan, PhD explains how to run thousands of simulation rollouts in parallel, scaling from a single machine to distributed clusters with minimal code changes. Learn how robotics teams can maximize GPU utilization, reduce infrastructure overhead, and spend more time improving robot policies. Read the blog: https://lnkd.in/gHvqbEi5

Scale Robot Policy Evaluation with Ray | Anyscale anyscale.com

1 Comment

Like Comment Share
Anyscale reposted this
Mengliao(Mike) Wang
1w
Report this post
Anyscale just published a case study on our work at Geotab! We've been building our AI platform with Ray and Anyscale to run AI/ML inference at scale efficiently and without burning through GPUs. And the results are: - 43x peak-hour throughput - 4x GPU utilization - 40% fewer GPUs at peak Read the full case study here: https://lnkd.in/gPHGGgvn #Geotab #Ray #Anyscale #AIPlatform

Geotab Centralizes Fleet Video AI on Anyscale anyscale.com

1 Comment

Like Comment Share
Anyscale

60,970 followers
1w Edited
Report this post
Ray Summit kicks off with a full day of hands-on training on Aug 24. 🛠️ Built for engineers running AI in production, not a weekend hackathon or a deploy-your-first-model tutorial. Choose your own training: Select 1 AM track and 1 PM track. Morning → Multimodal data processing pipelines for AI systems → Foundation model distributed training with Ray → Production-ready distributed inference with Ray Serve Afternoon → Scaling physical AI & robotics systems with Ray → Real-time search & recommendation systems for AI commerce → LLM post-training and high-performance serving Passes with training are limited. Secure them now with early pricing at $250! https://lnkd.in/gF3FqYHS
1 Comment

Like Comment Share
Anyscale reposted this
Robert Nishihara
1w
Report this post
Fast LLM inference with Ray Serve + vLLM + GKE. https://lnkd.in/gMsuYSZR

Improving Ray Serve LLM on GKE throughput, latency | Google Cloud Blog cloud.google.com

3 Comments

Like Comment Share

Browse jobs

Funding

Anyscale 4 total rounds

Last Round

Series C Sep 23, 2022

US$ 99.0M

Investors

Intel Capital Addition + 2 Other investors

See more info on crunchbase

Anyscale

Software Development

San Francisco, California 60,970 followers

Scalable compute for AI and Python. Creators of Ray distributed compute framework.

About us

Employees at Anyscale

Patrick Lonergan

View 766 employees at Anyscale

Locations

Updates

Join now to see what you are missing

Similar pages

Databricks

Branch

Cyberhaven

Anthropic

WHOOP

Together AI

Scale AI

Perplexity

Envoy

Conviva

Browse jobs

Engineer jobs

Software Engineer jobs

Account Executive jobs

Senior Software Engineer jobs

Enterprise Account Executive jobs

Machine Learning Engineer jobs

Scientist jobs

Vice President jobs

Analyst jobs

Developer jobs

Director jobs

Manager jobs

Software Engineering Manager jobs

Senior Product Manager jobs

Staff Software Engineer jobs

Associate jobs

Engineering Manager jobs

Intelligence Specialist jobs

Product Manager jobs

Director of Analytics jobs

Funding