San Francisco, California, United States
2K followers 500+ connections

Join to view profile

About

ML/Systems/Robotics Engineer, 5+ years of experience, with educational background in…

Activity

2K followers

See all activities

Experience & Education

  • Anyscale

View Alexandr’s full experience

See their title, tenure and more.

or

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Publications

  • Invertible Memory Flow Networks

    Long sequence neural memory remains a challenging problem. RNNs and their variants suffer from vanishing gradients, and Transformers suffer from quadratic scaling. Furthermore, compressing long sequences into a finite fixed representation remains an intractable problem due to the difficult optimization landscape. Invertible Memory Flow Networks (IMFN) make long sequence compression tractable through factorization: instead of learning end-to-end compression, we decompose the problem into…

    Long sequence neural memory remains a challenging problem. RNNs and their variants suffer from vanishing gradients, and Transformers suffer from quadratic scaling. Furthermore, compressing long sequences into a finite fixed representation remains an intractable problem due to the difficult optimization landscape. Invertible Memory Flow Networks (IMFN) make long sequence compression tractable through factorization: instead of learning end-to-end compression, we decompose the problem into pairwise merges using a binary tree of "sweeper" modules. Rather than learning to compress long sequences, each sweeper learns a much simpler 2-to-1 compression task, achieving O(log N) depth with sublinear error accumulation in sequence length. For online inference, we distilled into a constant-cost recurrent student achieving O(1) sequential steps. Empirical results validate IMFN on long MNIST sequences and UCF-101 videos, demonstrating compression of high-dimensional data over long sequences.

    Other authors
    See publication
  • Parent-Guided Semantic Reward Model (PGSRM): Embedding-Based Reward Functions for Reinforcement Learning of Transformer Language Models

    We introduce the Parent-Guided Semantic Reward Model (PGSRM), a lightweight reward framework for reinforcement learning (RL) of transformer language models. PGSRM replaces binary correctness signals, human preference data, and trained reward models with a simple signal: cosine similarity between a parent model's reference output embedding and a child model's generated output for the same input. This yields a dense, semantically meaningful reward with no human annotation or additional model…

    We introduce the Parent-Guided Semantic Reward Model (PGSRM), a lightweight reward framework for reinforcement learning (RL) of transformer language models. PGSRM replaces binary correctness signals, human preference data, and trained reward models with a simple signal: cosine similarity between a parent model's reference output embedding and a child model's generated output for the same input. This yields a dense, semantically meaningful reward with no human annotation or additional model training. We apply PGSRM on five language tasks and find that it produces smoother reward improvement and more stable PPO dynamics than a binary reward baseline, suggesting that embedding-based semantic rewards are a practical alternative to RLHF-style reward modeling for parent-guided alignment in smaller transformer models.

    See publication

Projects

  • Project Cerebrus: EEG-controlled Humanoid Robot

    -

    Project Cerebrus is an EEG-controlled robotics project that connects a PLUX NeuroBIT EEG hardware kit to a DIY humanoid robot. The system streams live data from the EEG hardware, classifies the signal into movement commands using subtractive convolutional layers, sends those commands through a websocket relay server, and executes movement sequences on the robot.

  • C++ Deep Learning Optimization Engine

    -

    A small C++ project, for a minimal deep learning optimization engine from scratch. The project includes a minimal Tensor class with automatic differentiation, matrix/vector operations, activation functions, softmax, cross-entropy-style loss construction, and SGD parameter updates.

  • TinyGPU

    -

    TinyGPU is a compact SystemVerilog matrix-vector accelerator built around a clean streaming architecture: a fetch_engine issues memory reads and pushes data through a skid-buffered valid/ready stream into a matrix_core that loads weights, accumulates results, and flushes an output vector. The top module (gpu_top) ties host control, memory interface, and result streaming together, exposing a simple programming model (set base addresses → RUN) with a unified busy status.

  • Project Benji: Autonomous Navigational/Conversational Robot

    -

    Autonomous patrol + conversational robot, using a self-hosted GPT language model, Whisper STT, and ElevenLabs TTS. Implements SLAM-style navigation with ultrasonic sensors, multi-user identity memory, agentic movement/vision commands, and a custom Q-table RL algorithm with sentiment analysis to adapt Benji’s behavior per user.

  • Learning Rate - AutoML Platform

    -

    Learning Rate is an AutoML platform to simplify machine learning for non-experts, allowing users to upload their own data and quickly generate a vast array of ML models.
    Tech Stack: Next.js for front-end development, Flask & Sci-Kit Learn for backend and machine learning, and GCP as data lake for model & data storage.

  • Reinforcement Learning Algorithms - DQN, PPO, Q-Learning

    -

    Utilized DQN algorithm for learning Q-values from raw state observations, achieving a trained model capable of navigating the Frozen Lake environment.
    Implemented PPO algorithm to optimize policy updates, resulting in a trained model proficient in controlling the Bipedal Walker environment.
    Used tabular Q-learning for optimal action selection, training a model capable of playing blackjack.

  • Blocky - NLP + Blockchain Data Marketplace

    -

    Blocky uses a fine-tuned bloom-3b model, as well as a 300-dimensional vector space model to match data seekers with providers, via a vector database architecture.
    Users configure AI agents, powered by the Gemini-Pro LM, to negotiate data transactions via the ETH main net.
    Tech Stack: Next.js, Firebase, Flask, Bloom-3b LM, FastText VS Model, Gemini-Pro LM, Solidity.

  • Sentinel - Social Media Fraud Detection

    -

    Developed Sentinel, an LLM to detect fraudulent activities for platforms/mediums such as YouTube, Twitter, Instagram, and SMS.
    Integrated Apache Spark for data processing, transformer architecture + BERT model for NLP, and Next.js for frontend design.

  • Prima - Fashion Trend Analysis Tool

    -

    Developed a Python-based fashion trend analysis tool, incorporating PostgreSQL for data management, Node.js for server-side operations, and Next.js for user interface design.
    Utilized Apache Airflow automated web scraping process, and SQL data wrangling to identify and analyze fashion trends.

  • Theseus - nanoGPT

    -

    Engineered a transformer-based language model to generate Greek mythology narratives, utilizing Python + PyTorch and a custom dataset.
    Configured model with multi-head attention, layer normalization, and feed-forward networks; optimized for CPU with a modified architecture and hyperparameters.

  • RePrice - Web Scraping Pricing/Ratings Comparison Tool

    -

    Developed a web app with Typescript and Python to scrape online data for monthly pricing and ratings comparisons.
    Uses AI algorithm to estimate monthly pricing plans of top competitors for any inputted product category.
    Stores product information in postreSQL database.

  • WebAnalytics - ETL data pipeline + custom KPI dashboard

    -

    Developed an automated web analytics tool leveraging GA4, GCP, and AppScript to streamline the collection and parsing of web analytics data.
    Features a custom dashboard for intuitive display and analysis of key web performance metrics, enhancing data-driven decision-making.

  • MarketDriven - Strategic Financial Planning Model

    -

    Developed a financial forecasting model using Excel to predict income statements, balance sheets, and cash flow statements up to seven years ahead.
    Incorporates user-inputted COGS, OPEX, and their projected growth, alongside marketing campaign data, estimated conversion rates, and churn rates.
    Outputs comprehensive financial forecasts to aid in strategic planning and decision-making.

Test Scores

  • SAT

    Score: 1540

    Score Breakdown:
    Math: 800, Reading: 740

  • SAT Math Level II Subject Test

    Score: 800

Languages

  • Russian

    Native or bilingual proficiency

  • English

    Native or bilingual proficiency

View Alexandr’s full profile

  • See who you know in common
  • Get introduced
  • Contact Alexandr directly
Join to view full profile

Explore top content on LinkedIn

Find curated posts and insights for relevant topics all in one place.

View top content

Add new skills with these courses