Log inSign up
Aran Komatsuzaki
6,779 posts
user avatar
Aran Komatsuzaki
@arankomatsuzaki
Sharing AI research. Early work on AI (GPT-J, scaling, MoE). Ex ML PhD (GT) & Google.
arankomatsuzaki.wordpress.com/about-me/
Joined November 2016
375
Following
181.6K
Followers

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
  • user avatar
    Aran Komatsuzaki
    @arankomatsuzaki
    Feb 8, 2023
    OpenAI did what used to be considered impossible. They made people want to use Bing.
    1.1M
  • user avatar
    Aran Komatsuzaki
    @arankomatsuzaki
    Feb 1, 2025
    529K
  • user avatar
    Aran Komatsuzaki
    @arankomatsuzaki
    Jan 30, 2025
    The leap from o1 to o3 is exponential, completely bypassing o2. If this pattern holds, o3 won’t lead to o4—it’ll jump straight to o9.
    371K
  • user avatar
    Aran Komatsuzaki
    @arankomatsuzaki
    May 14, 2025
    Don't forget to check your DMs
    244K
  • user avatar
    Aran Komatsuzaki
    @arankomatsuzaki
    Sep 9, 2025
    Google presents an AI system to write expert-level scientific software. Using LLMs + tree search, it invented novel methods in bioinformatics, epidemiology, geospatial analysis & more, often surpassing human SOTA. (1/4)
    535K
  • user avatar
    Aran Komatsuzaki
    @arankomatsuzaki
    May 31, 2021
    When you generate images with VQGAN + CLIP, the image quality dramatically improves if you add "unreal engine" to your prompt. People are now calling this "unreal engine trick" lol e.g. "the angel of air. unreal engine"
  • user avatar
    Aran Komatsuzaki
    @arankomatsuzaki
    Mar 31, 2023
    BloombergGPT: A Large Language Model for Finance Presents BloombergGPT, a 50 billion parameter language model that is trained on a wide range of financial data. arxiv.org/abs/2303.17564
    504K
  • user avatar
    Aran Komatsuzaki
    @arankomatsuzaki
    May 25, 2022
    Large Language Models are Zero-Shot Reasoners Simply adding “Let’s think step by step” before each answer increases the accuracy on MultiArith from 17.7% to 78.7% and GSM8K from 10.4% to 40.7% with GPT-3. arxiv.org/abs/2205.11916
  • user avatar
    Aran Komatsuzaki
    @arankomatsuzaki
    Mar 21, 2021
    We've released the weights (1.3B and 2.7B) of our replication of GPT-3 🥳 Using the updated Colab notebook in the repo you should be able to finetune the models on your own data as well as run inference.
    GitHub - EleutherAI/gpt-neo: An implementation of model parallel GPT-2 and GPT-3-style models using...
    From github.com
  • user avatar
    Aran Komatsuzaki
    @arankomatsuzaki
    Sep 17, 2025
    Pattern of my 20s: “This idea’s great, but others are better positioned. I’m late and lack domain expertise. I’ll find something new.” → Later: someone even less qualified makes it work.
    94K
  • user avatar
    Aran Komatsuzaki
    @arankomatsuzaki
    Nov 30, 2023
    Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation proj: humanaigc.github.io/animate-anyone/ abs: arxiv.org/abs/2311.17117
    00:00
    772K
  • user avatar
    Aran Komatsuzaki
    @arankomatsuzaki
    Feb 12, 2025
    OpenAI presents: Competitive Programming with Large Reasoning Models - Competed live at IOI 2024 - o3 achieved gold - General-purpose o3 surpasses o1 w/ hand-crafted pipelines specialized for coding resultss
    625K
  • user avatar
    Aran Komatsuzaki
    @arankomatsuzaki
    Jan 31, 2025
    85K
  • user avatar
    Aran Komatsuzaki
    @arankomatsuzaki
    Feb 18, 2025
    If you think you have regrets, here are mine: - I turned down an early invitation from Noam to join CharacterAI—and similarly from Igor to join XAI. - I stumbled through a coding interview for OpenAI when Wojciech asked if I wanted to work on GPT-4. - I was once trying to
    220K