Li Li 李力

San Francisco Bay Area

Sign in to view Li’s full profile

Li can introduce you to 10+ people at Meta

or

New to LinkedIn? Join now

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

4K followers 500+ connections

View mutual connections with Li

Li can introduce you to 10+ people at Meta

or

New to LinkedIn? Join now

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Join to view profile

About

Li Li is a highly experienced professional with over 10 years of expertise in machine…

Activity

4K followers

Li Li 李力 reposted this
Report this post
Ying Guo

Ying Guo

1y

Li Li 李力 reposted this
Come join us to build cutting edge custom silicon that revolutionizes the AR technology! You will be involved in the full silicon development cycle from pre- to post- silicon. You will work on validating multiple complex SoCs with opportunities to contribute to tech stack across HW, FW, SW, runtime and UX. Let's build the AR vision together! https://lnkd.in/g_qDzpYQ

Danielle Tay Mendez

Danielle Tay Mendez

1y

Li Li 李力 reposted this
Hiring BSP/ Embedded Engineers (Taipei) to build systems software for Meta devices. Come join us to get your hands on building the world's most advanced AI/ AR products! https://lnkd.in/g5NutiFb https://lnkd.in/g54BnQ2c Apply directly or connect to learn more about the opportunity 😎 Pei Zheng Ying Guo

Embedded Software Engineer

Embedded Software Engineer
2 Comments
Li Li 李力

Li Li 李力

1y
Report this post
Li Li 李力 posted this
We are hiring an IC6 engineer for the instagram feed retrieval team. This is a fast growth product and team with a lot of growth opportunities. Please feel free to reach out to me if you are interested and in the process of team match at Meta.
1 Comment
Li Li 李力

Li Li 李力

1y
Report this post
Li Li 李力 shared this
Join us to build the future of Instagram Feed! Please feel free to DM me if you are interested!

Zhe Zhang

Zhe Zhang

1y

Li Li 李力 shared this
🚀 We’re Hiring: Staff Machine Learning Engineers – Instagram Feed Relevance Join our team to shape the future of the Instagram home feed! We’re looking for talented engineers to work on cutting-edge, end-to-end recommendation systems that power the content users see—whether it’s from accounts they follow or AI-recommended content. Our work spans all media types: photos, carousels, reels, and more. As part of this role, you’ll dive deep into Retrieval, Ranking, and Delivery, optimizing across the stack to drive impactful metrics like session visits and DAU. This is a unique opportunity to leverage state-of-the-art technologies to make a real difference for both Instagram and Meta. If you’re experienced in machine learning, recommendation systems, and passionate about improving user experience through ML-driven optimization, we want to hear from you. 📩 Reach out if you’re interested.
Li Li 李力

Li Li 李力

3y
Report this post
Li Li 李力 shared this

Alex Shcheglovitov 🇺🇦

Alex Shcheglovitov 🇺🇦

3y

Li Li 李力 shared this
Excited to share our latest research on human #stemcells derived telencephalic #brain #organoids and #autism gene SHANK3 published with Nature Portfolio in Nature Communications today! Read here: https://lnkd.in/gde7mRMn And here for lay audience by Julie Kiefer Kiefer: https://lnkd.in/gEPGsut2 - We generated human telencephalic organoids from stem cell-derived single neural rosettes. - We showed that single neural rosette-derived organoids contain pallial and subpallial neural progenitors, excitatory and inhibitory neurons, as well as macroglial and periendothelial cells, and exhibit predictable organization and cytoarchitecture. - We found that about 90% of neurons in organoids are functionally mature and fire action potentials - Finally, we demonstrated that neurons in organoids with a deletion of autism and phelan-mcdermid syndrome gene SHANK3 exhibit excitability deficits and abnormal expression of neuronal cell-adhesion molecules. People contributed to this study: Yueqi Wang, Simone Chiola, Guang Yang, Chad Russell, Celeste Armstrong, Yuanyuan Wu, Jay Spampanato, Paisley Tarboton, H M Arif Ullah, Nicolas Edgar, Amelia Chang, David Harmin, Vittoria Dickinson Bocchi, Elena Vezzoli, Dario Besusso, Jun Cui, Elena Cattaneo, and Jan Kubanek The study was performed in the department of Neurobiology at University of Utah School of Medicine The study was supported by National Institute of Mental Health (NIMH), National Institute of Neurological Disorders and Stroke (NINDS), University of Utah Health neuroscience initiative and genome project, DevBio T32 training grant. Huge THANK YOU to PMS families that donated cells for our study!

Modeling human telencephalic development and autism-associated SHANK3 deficiency using organoids generated from single neural rosettes - Nature Communications

Modeling human telencephalic development and autism-associated SHANK3 deficiency using organoids generated from single neural rosettes - Nature Communications
12 Comments
Li Li 李力

Li Li 李力

3y
Report this post
Li Li 李力 shared this

Jeong-Yoon Lee

Jeong-Yoon Lee

3y

Li Li 李力 shared this
10 years ago today, my team (Michael Jahrer, Andreas Töscher, Jacob Spoelstra, Lei Shi, Hang Zhang, Jingjing (Bruce) Deng) was awarded the 2nd prize for #KDDCup Track 2 at the KDD opening ceremony in Beijing. This experience shaped my career. ACM SIGKDD & Annual KDD Conference, Kaggle

public_profile__posts
13 Comments
Li Li 李力

Li Li 李力

3y
Report this post
Li Li 李力 shared this
A nice article from Dr. Luciano Abriata featuring our recent work at Google Research. https://lnkd.in/g8dDb68c Link to our paper "Evolving symbolic density functionals" https://lnkd.in/gn948y5T #google #research #symbolicregression #machinelearning #AI4Science #dft

Google proposes new method to derive analytical expressions for terms in quantum mechanics…

Google proposes new method to derive analytical expressions for terms in quantum mechanics…
Li Li 李力

Li Li 李力

4y
Report this post
Li Li 李力 shared this
Can computer construct density functionals in the symbolic form? I would like to share with you our latest work on machine learning applying to density functional theory (DFT) -- Evolving symbolic density functionals (https://lnkd.in/gn948y5T). Inspired by recent advances in AutoML and program synthesis, we proposed a new framework, Symbolic Functional Evolutionary Search (SyFES), that can automatically produce XC functionals with similar simplicity as functionals designed by humans in the past few decades -- symbolic forms with a manageable amount of parameters. This work is machine learning applied to developing human-readable scientific expressions and not just a blackbox prediction. I hope you find it interesting. #machinelearning #symbolic #ai4science #dft

public_profile__posts
9 Comments

Li Li 李力 liked this
Report this post
Li Li 李力 liked this

Lingjuan Peng

Lingjuan Peng

1w

Li Li 李力 liked this
My team is looking for a strong L6+ engineer for Model Quality & Evals. NYC&MTV locations. Please reach out to me directly if you are interested. Model Integration & Tuning: Integrate and tune next-generation GenAI models to optimize Gemini App for maximum conversational fluidity & intelligence. Evaluation Infrastructure: Develop and maintain robust evaluation frameworks, including automated user evaluations (AutoEvals) and product quality scorecards, to monitor and improve end-to-end performance. Feature Engineering: Lead the engineering of advanced conversational capabilities, in coordination with GenAI and infrastructure teams. Regression Analysis: Diagnose and resolve complex model regressions and performance bottlenecks to ensure a reliable and high-quality user experience.
1 Comment
Li Li 李力 liked this
Report this post
Marc Coram

Marc Coram

1w

Li Li 李力 liked this
It has been an exciting journey working on ERA at Google. It provides an easy way to identify and improve code, including data analytic code, for whatever scoreable goal you bring: validation score, run time, etc, or a custom tuned combination, with an agent to help you get it done and tools to search the literature and for ideation. It's been a huge team effort-- thanks all--and I hope we can have a positive impact on Science.

Subhashini Venugopalan

Subhashini Venugopalan

1mo

Li Li 李力 liked this
📣 Our work on ERA (Empirical Research Assistance) has been published in Nature — and we just announced its broad availability via Gemini for Science at Google I/O! 🚀 The project started with a simple premise: What if we could accelerate scientific discovery by automating the creation of scientific software? 💡 Writing scientific tools and the software to support computational experiments is a significant part of the research process — and often a serious bottleneck. It takes substantial iteration (and likely a few PhD theses) to converge. Our team developed ERA, an approach that takes the core principles of iterative search and self-improvement — the same drivers behind breakthroughs like AlphaGo — and applies them to scientific coding. By treating tough challenges as "scorable tasks," an LLM-powered system can systematically explore, test, and optimize its own code to create expert-level tools. Seeing this evolve from small-scale evaluations to real-world impact has been an incredible journey. In our collaborations with scientists, ERA is already delivering superhuman results in fields we care about deeply: 🧬 Bioinformatics 🦠 Epidemiology 🧠 Neuroscience 🌍 Climate & Sustainability The tool is live — and I'm truly excited to see what the scientific community builds with it. 👇 Explore the research and try it yourself: 🧪 Gemini for Science: labs.google/science 📄 Nature Paper: https://lnkd.in/gyN2rxS9 💻 GitHub Repository: https://lnkd.in/gkW9aHQb ✍️ Introductory Blog: https://lnkd.in/gwCY8Emi 💡 Real-World Applications: https://lnkd.in/gMd8TbWr #ArtificialIntelligence #Science #Research #Google DeepMind #Google Research #ScientificDiscovery #AIForScience #GeminiForScience

public_profile__reactions
Li Li 李力 liked this
Report this post
Li Li 李力 liked this

Jerry Yu

Jerry Yu

3w

Li Li 李力 liked this
"Instagram is coming next." — Netflix's co-CEO. Hiring Staff ML Engineer (IC6) to lead recommendations for Instagram on TV (Menlo Park / NYC) "Instagram is coming next." That's how Netflix's co-CEO described the competitive landscape to investors earlier this year. Instagram is moving to the biggest screen in the house, and we're looking for someone to help lead the recommendations behind it. We started testing Instagram on TV late last year, and it's quickly become one of the most exciting bets at Instagram, an early, fast-growing, greenfield surface where you can set technical direction from near zero. Why now: Entertainment has moved back to the living room. Connected TV now captures 58% of digital video time, about 85% of US households have a smart TV, and it's one of the fastest-growing entertainment surfaces in the US. We have the creator ecosystem, the brand, and the technology to compete. And we've only just gotten started, with an ambition to become a top-tier player in TV. What you'd own: Our team builds the recommendation engine and personalization experiences behind Instagram on TV, including the interest-based and social channels (e.g. sports, music, trending) that organize how people discover content on the big screen. People watch differently on a TV — longer, lean-back sessions, less active navigation, co-viewing with the sound on. And Instagram, on both mobile and TV, is shifting towards longer content that fits the format. The opportunity is to build a recommendations stack and TV-native models purpose-built for that experience, from the ground up. This is a staff-level role across the full stack including retrieval, ranking, value modeling, channels, latency, and delivery, with real room to shape architecture and direction, not just execute a roadmap. A few of the open problems: 1. Core media recommendations- optimizing for the unique ways people watch on TV and surfacing the best content 2. Long-form recommendations- bringing TV-native content into recs 3. Growth relevance - accelerating retention by building mobile-to-TV loops 4. LLM-powered recommendations- shipping LLM-powered launches in partnership with research teams Who we're looking for: - Strong background in ML / relevance or large-scale recommendation and ranking systems - A track record of setting technical vision in ambiguous, 0-to-1 spaces — not just executing a defined plan - Demonstrated ability to level up the engineers around you through mentorship, design reviews, and raising the bar - You thrive in a fast-paced, high-impact environment with strong eng and cross-functional partners If this sounds like you (or someone you know), DM me and happy to chat. #MachineLearning #Recommendations #Hiring #Instagram #ConnectedTV #RecSys

public_profile__reactions
2 Comments
Li Li 李力 liked this
Report this post
Li Li 李力 liked this

大大帶我飛 Dadafly

大大帶我飛 Dadafly

1mo

Li Li 李力 liked this
我之前在 Meta 有個強者同事 Yu-Keng 大大到現在做了快九年 FB、IG、Threads 的演算法他不僅在兩年半從 E4 升到 E6 也拿過幾次 Meta 裡少數人才有機會拿到的額外股票 bonus 最近他 39 歲，決定提早退休（FIRE）他算了算存款，覺得夠了因為有些旅行和登山計畫，50 歲以後體力可能就真的跟不上了他最近想要留點時間回饋給社群所以也決定上來我們平台當大大來幫助大家剛上線有開放免費 Coffee Chat！如果你在準備大廠行為面試、工作上遇到問題或是想認真盤一下退休和理財這件事都可以去找他免費聊聊跟大神免費聊天的機會不多，有需要的不要錯過～ 👉 https://lnkd.in/gFZDJSxq

Yu-Keng Shih｜前 Meta Staff Engineer｜39 歲退休｜美國科技面試指導與財務獨立規劃導師 | 大大帶我飛

Yu-Keng Shih｜前 Meta Staff Engineer｜39 歲退休｜美國科技面試指導與財務獨立規劃導師 | 大大帶我飛
6 Comments
Li Li 李力 liked this
Report this post
Li Li 李力 liked this

Nikhil Mehta

Nikhil Mehta

1mo

Li Li 李力 liked this
Excited to share our latest research on GenRetrieval with LLMs! 🚀 Fine-tuning LLMs for specialized GenRetrieval tasks often leads to catastrophic forgetting of their general capabilities. While mixing pretraining data with GenRetrieval data (such as data replay methods) can help mitigate this forgetting, it is often not a viable option. Original pretraining data is frequently proprietary or unavailable , and the computational cost of re-training a conversational agent is highly prohibitive. To solve this, we introduce ORBIT (Origin-Regulated Back-merging of Intermediate Trajectories). ORBIT actively tracks the distance between fine-tuned and initial model weights during training. When parameters stray too far, it dynamically back-merges the original model weights as a regularization step. ORBIT leads to a lightweight, dynamic adaptation method that achieves top-tier performance on specialized tasks without significantly sacrificing foundational reasoning skills—all without needing access to the original training data! Read the full paper here: https://lnkd.in/gVVy7TmH Joint work with: Neha Verma, Shao-Chuan W., Naijing Zhang, Alicia Y. Tsai, Li Wei, Lukasz Heldt, Lichan Hong, Ed H. Chi, Xinyang Yi #GenerativeRetrieval #LLMs #MachineLearning #ArtificialIntelligence #ModelMerging #NLP #Research

ORBIT: Preserving Foundational Language Capabilities in GenRetrieval via Origin-Regulated Merging

ORBIT: Preserving Foundational Language Capabilities in GenRetrieval via Origin-Regulated Merging
2 Comments
Li Li 李力 liked this
Report this post
Li Li 李力 liked this

Xianzhe Ma

Xianzhe Ma

3mo

Li Li 李力 liked this
I’m happy to share that I’m starting a new position as Principal Engineer, Gemini App at Google DeepMind!
3 Comments
Li Li 李力 liked this
Report this post
Chi Tang

Chi Tang

3mo

Li Li 李力 liked this
the most complex project to date for me and we did it! #trust #google

Robby Stein

Robby Stein

3mo

Li Li 李力 liked this
Today, we’re expanding Personal Intelligence for AI Mode in Search in the U.S. 🚀 This feature has totally changed how I use AI in Search. Here’s how it works: 1️⃣ Opt in to connect apps like Gmail and Google Photos. Search connects your info to give you responses that uniquely match your specific context and interests. 2️⃣ Start asking for things like ideas and recommendations where this is particularly helpful. For example: I wanted new podcast ideas and Personal Intelligence in AI Mode gave me spot-on recos on AI and product building shows that are similar to some of my favorites (hey @lennysan!). It even found some unexpected options around dad life working in tech. 🎧 This was designed with your privacy and transparency in mind. Connecting to Gmail and Google Photos is secure and it’s off by default. 🔒 If you’re interested in opting in, here is a direct link to update your settings: https://lnkd.in/gNvtAbnw

public_profile__reactions
2 Comments
Li Li 李力 liked this
Report this post
Li Li 李力 liked this

Lingjuan Peng

Lingjuan Peng

3mo

Li Li 李力 liked this
I am #hiring! (Android, iOS, Web, Backend and ML Engineers) The Growth & Discovery team powers the Gemini App's growth engine and crafts user discovery experiences. This mission blends traditional growth engineering—focusing on loops for acquisition, activation, and retention—with the intricate technical task of "Discovery," which involves engineering systems that enable users to explore and maximize AI potential. You will navigate ambiguity to define and build consumer-facing products, working across the stacks. We are looking for engineers with proven track records who combine a passion for building for people with the technical ability to iterate fast. You will collaborate across functions to deliver high-quality, simple, and reusable solutions with a sharp eye for engineering and design craft. Please fill out the candidates form if you are interested. https://lnkd.in/gkyrTqS5
8 Comments

See all activities

Experience & Education

Meta

******

******** ********
******* ****

******* ******** *********
** ******

****** ** ********** ******* ********** *** ******** ******* ******** ********** ** ******* ********** ****** undefined

2011 - 2016
***** **********

********** ****** *******

2007 - 2011

View Li’s full experience

See their title, tenure and more.

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Publications

Optimization of molecules via deep reinforcement learning

Scientific Reports 2019
We present a framework, which we call Molecule Deep Q-Networks (MolDQN), for molecule optimization by combining domain knowledge of chemistry and state-of-the-art reinforcement learning techniques (double Q-learning and randomized value functions). We directly define modifications on molecules, thereby ensuring 100% chemical validity. Further, we operate without pre-training on any dataset to avoid possible bias from the choice of that set. MolDQN achieves comparable or better performance…

We present a framework, which we call Molecule Deep Q-Networks (MolDQN), for molecule optimization by combining domain knowledge of chemistry and state-of-the-art reinforcement learning techniques (double Q-learning and randomized value functions). We directly define modifications on molecules, thereby ensuring 100% chemical validity. Further, we operate without pre-training on any dataset to avoid possible bias from the choice of that set. MolDQN achieves comparable or better performance against several other recently published algorithms for benchmark molecular optimization tasks. However, we also argue that many of these tasks are not representative of real optimization problems in drug discovery. Inspired by problems faced during medicinal chemistry lead optimization, we extend our model with multi-objective reinforcement learning, which maximizes drug-likeness while maintaining similarity to the original molecule. We further show the path through chemical space to achieve optimization for a molecule to understand how the model works.

Other authors
See publication
Can exact conditions improve machine-learned density functionals?

The Journal of Chemical Physics 2018
Historical methods of functional development in density functional theory have often been guided by analytic conditions that constrain the exact functional one is trying to approximate. Recently, machine-learned functionals have been created by interpolating the results from a small number of exactly solved systems to unsolved systems that are similar in nature. For a simple one-dimensional system, using an exact condition, we find improvements in the learning curves of a machine learning…

Historical methods of functional development in density functional theory have often been guided by analytic conditions that constrain the exact functional one is trying to approximate. Recently, machine-learned functionals have been created by interpolating the results from a small number of exactly solved systems to unsolved systems that are similar in nature. For a simple one-dimensional system, using an exact condition, we find improvements in the learning curves of a machine learning approximation to the non-interacting kinetic energy functional. We also find that the significance of the improvement depends on the nature of the interpolation manifold of the machine-learned functional.

Other authors
See publication
Efficient prediction of 3D electron densities using machine learning

NeurIPS 2018 Workshop on Machine Learning for Molecules and Materials 2018
The Kohn-Sham scheme of density functional theory is one of the most widely used methods to solve electronic structure problems for a vast variety of atomistic systems across different scientific fields. While the method is fast relative to other first principles methods and widely successful, the computational time needed is still not negligible, making it difficult to perform calculations for very large systems or over long time-scales. In this submission, we revisit a machine learning model…

The Kohn-Sham scheme of density functional theory is one of the most widely used methods to solve electronic structure problems for a vast variety of atomistic systems across different scientific fields. While the method is fast relative to other first principles methods and widely successful, the computational time needed is still not negligible, making it difficult to perform calculations for very large systems or over long time-scales. In this submission, we revisit a machine learning model capable of learning the electron density and the corresponding energy functional based on a set of training examples. It allows us to bypass solving the Kohn-Sham equations, providing a significant decrease in computation time. We specifically focus on the machine learning formulation of the Hohenberg-Kohn map and its decomposability. We give results and discuss challenges, limits and future directions.

Other authors
See publication
Tensor Field Networks: Rotation-and Translation-Equivariant Neural Networks for 3D Point Clouds

arXiv:1802.08219 2018
We introduce tensor field neural networks, which are locally equivariant to 3D rotations, translations, and permutations of points at every layer. 3D rotation equivariance removes the need for data augmentation to identify features in arbitrary orientations. Our network uses filters built from spherical harmonics; due to the mathematical consequences of this filter choice, each layer accepts as input (and guarantees as output) scalars, vectors, and higher-order tensors, in the geometric sense…

We introduce tensor field neural networks, which are locally equivariant to 3D rotations, translations, and permutations of points at every layer. 3D rotation equivariance removes the need for data augmentation to identify features in arbitrary orientations. Our network uses filters built from spherical harmonics; due to the mathematical consequences of this filter choice, each layer accepts as input (and guarantees as output) scalars, vectors, and higher-order tensors, in the geometric sense of these terms. We demonstrate the capabilities of tensor field networks with tasks in geometry, physics, and chemistry.

Other authors
See publication
Bypassing the Kohn-Sham equations with machine learning

Nature Communications October 1, 2017
Last year, at least 30,000 scientific papers used the Kohn-Sham scheme of density functional theory to solve electronic structure problems in a wide variety of scientific fields, ranging from materials science to biochemistry to astrophysics. Machine learning holds the promise of learning the kinetic energy functional via examples, by-passing the need to solve the Kohn-Sham equations. This should yield substantial savings in computer time, allowing either larger systems or longer time-scales to…

Last year, at least 30,000 scientific papers used the Kohn-Sham scheme of density functional theory to solve electronic structure problems in a wide variety of scientific fields, ranging from materials science to biochemistry to astrophysics. Machine learning holds the promise of learning the kinetic energy functional via examples, by-passing the need to solve the Kohn-Sham equations. This should yield substantial savings in computer time, allowing either larger systems or longer time-scales to be tackled, but attempts to machine-learn this functional have been limited by the need to find its derivative. The present work overcomes this difficulty by directly learning the density-potential and energy-density maps for test systems and various molecules. Both improved accuracy and lower computational cost with this method are demonstrated by reproducing DFT energies for a range of molecular geometries generated during molecular dynamics simulations. Moreover, the methodology could be applied directly to quantum chemical calculations, allowing construction of density functionals of quantum-chemical accuracy.

Other authors
See publication
Lazy stochastic principal component analysis

IEEE International Conference on Data Mining Workshop October 1, 2017
Stochastic principal component analysis (SPCA) has become a popular dimensionality reduction strategy for large, high-dimensional datasets. We derive a simplified algorithm, called Lazy SPCA, which has reduced computational complexity and is better suited for large-scale distributed computation. We prove that SPCA and Lazy SPCA find the same approximations to the principal subspace, and that the pairwise distances between samples in the lower-dimensional space is invariant to whether SPCA is…

Stochastic principal component analysis (SPCA) has become a popular dimensionality reduction strategy for large, high-dimensional datasets. We derive a simplified algorithm, called Lazy SPCA, which has reduced computational complexity and is better suited for large-scale distributed computation. We prove that SPCA and Lazy SPCA find the same approximations to the principal subspace, and that the pairwise distances between samples in the lower-dimensional space is invariant to whether SPCA is executed lazily or not. Empirical studies find downstream predictive performance to be identical for both methods, and superior to random projections, across a range of predictive models (linear regression, logistic lasso, and random forests). In our largest experiment with 4.6 million samples, Lazy SPCA reduced 43.7 hours of computation to 9.9 hours. Overall, Lazy SPCA relies exclusively on matrix multiplications, besides an operation on a small square matrix whose size depends only on the target dimensionality.

Other authors
See publication
Pure density functional for strong correlations and the thermodynamic limit from machine learning

Phys. Rev. B 2016
We use the density-matrix renormalization group, applied to a one-dimensional model of continuum Hamiltonians, to accurately solve chains of hydrogen atoms of various separations and numbers of atoms. We train and test a machine-learned approximation to F[n], the universal part of the electronic density functional, to within quantum chemical accuracy. We also develop a data-driven, atom-centered basis set for densities which greatly reduces the computational cost and accurately represents the…

We use the density-matrix renormalization group, applied to a one-dimensional model of continuum Hamiltonians, to accurately solve chains of hydrogen atoms of various separations and numbers of atoms. We train and test a machine-learned approximation to F[n], the universal part of the electronic density functional, to within quantum chemical accuracy. We also develop a data-driven, atom-centered basis set for densities which greatly reduces the computational cost and accurately represents the physical information in the machine-learning calculation. Our calculation (a) bypasses the standard Kohn-Sham approach, avoiding the need to find orbitals, (b) includes the strong correlation of highly stretched bonds without any specific difficulty (unlike all standard DFT approximations), and (c) is so accurate that it can be used to find the energy in the thermodynamic limit to quantum chemical accuracy.

Other authors
See publication
Understanding kernel ridge regression: Common behaviors from simple functions to density functionals

Int. J. Quant. Chem. 2015
Accurate approximations to density functionals have recently been obtained via machine learning (ML). By applying ML to a simple function of one variable without any random sampling, we extract the qualitative dependence of errors on hyperparameters. We find universal features of the behavior in extreme limits, including both very small and very large length scales, and the noise-free limit. We show how such features arise in ML models of density functionals.

Other authors
See publication
Understanding machine-learned density functionals

Int. J. Quant. Chem. 2015
Kernel ridge regression is used to approximate the kinetic energy of non-interacting fermions in a one-dimensional box as a functional of their density. The properties of different kernels and methods of cross-validation are explored, and highly accurate energies are achieved. Accurate {\em constrained optimal densities} are found via a modified Euler-Lagrange constrained minimization of the total energy. A projected gradient descent algorithm is derived using local principal component…

Kernel ridge regression is used to approximate the kinetic energy of non-interacting fermions in a one-dimensional box as a functional of their density. The properties of different kernels and methods of cross-validation are explored, and highly accurate energies are achieved. Accurate {\em constrained optimal densities} are found via a modified Euler-Lagrange constrained minimization of the total energy. A projected gradient descent algorithm is derived using local principal component analysis. Additionally, a sparse grid representation of the density can be used without degrading the performance of the methods. The implications for machine-learned density functional approximations are discussed.

Other authors
- $Klaus-Robert M\"{u}ller$
See publication
Graded index photonic hole: Analytical and rigorous full wave solution

Physical Review B August 27, 2010

We present a rigorous full wave approach to the omnidirectional photonic hole
(PH), an optical system inspired by celestial phenomena and characterized by a radially
graded refractive index n (r)∼ 1/r α/2. It is analytically demonstrated that light capture is
effective for α≥ α c= 2. Our analyses are corroborated by precise numerical simulations of
steady-state and time-evolution behaviors.

See publication

Join now to see all publications

Patents

Protecting devices from malicious files based on n-gram processing of sequential data

Issued January 1, 2018 US 15490797
Under one aspect, a method is provided for protecting a device from a malicious file. The method can be implemented by one or more data processors forming part of at least one computing device and can include extracting from the file, by at least one data processor, sequential data comprising discrete tokens. The method also can include generating, by at least one data processor, n-grams of the discrete tokens. The method also can include generating, by at least one data processor, a vector of…

Under one aspect, a method is provided for protecting a device from a malicious file. The method can be implemented by one or more data processors forming part of at least one computing device and can include extracting from the file, by at least one data processor, sequential data comprising discrete tokens. The method also can include generating, by at least one data processor, n-grams of the discrete tokens. The method also can include generating, by at least one data processor, a vector of weights based on respective frequencies of the n-grams. The method also can include determining, by at least one data processor and based on a statistical analysis of the vector of weights, that the file is likely to be malicious. The method also can include initiating, by at least one data processor and responsive to determining that the file is likely to be malicious, a corrective action.

Other inventors
See patent

Projects

State Farm Distracted Driver Detection @ Kaggle.com

Aug 2016

Using deep learning to detect drivers' distracted behaviors automatically from dashboard cameras.
- Rank 90th/1440. (top 7%)
- Because there are only 26 unique drivers in the training set, it is very easy to overfit. Two pre-trained model are used.
- Fine tuning the VGG-16 and VGG-19 network with different cross-validation strategy.
- Ensemble 4 best convolutional neural network models.

See project
Facebook V: Predicting Check Ins @Kaggle.com

Jul 2016

Identify the correct place for check ins in an artificial world consisting of more than 100,000 places located in a 10 km by 10 km square.
- Rank 5th/1212.
- Write own framework code for easy and fast model selection and ensemble.
- Ensemble model of k-nearest neighbors, random forest, extra-trees, gradient boosting trees, naive bayes, kernel density estimation.
- Detail solution…

Identify the correct place for check ins in an artificial world consisting of more than 100,000 places located in a 10 km by 10 km square.
- Rank 5th/1212.
- Write own framework code for easy and fast model selection and ensemble.
- Ensemble model of k-nearest neighbors, random forest, extra-trees, gradient boosting trees, naive bayes, kernel density estimation.
- Detail solution explanation:
https://www.kaggle.com/c/facebook-v-predicting-check-ins/forums/t/22112/5th-place-solution

See project
Expedia Hotel Recommendations @ Kaggle.com

Jun 2016
Contextualize customer data and predict the likelihood a user will stay at 100 different hotel groups.
- Top 2%. Rank 40th/1974.

Other creators
See project
Home Depot Product Search Relevance @Kaggle.com

Apr 2016
Predict the relevance of search results from product title, description, search_term and attribute files.
- Top 2%. Rank 44th/2125.
- Impute important data (e.g. brand, material...) by customized local dictionary from existing data. Improve the percentage of data having brand from 80.9% to 99.4%.
- Achieve professional and accurate spell correction by crawling google. Correct 13% typos in search term.
- Feature engineering from natural language. Including semantic analysis, word…

Predict the relevance of search results from product title, description, search_term and attribute files.
- Top 2%. Rank 44th/2125.
- Impute important data (e.g. brand, material...) by customized local dictionary from existing data. Improve the percentage of data having brand from 80.9% to 99.4%.
- Achieve professional and accurate spell correction by crawling google. Correct 13% typos in search term.
- Feature engineering from natural language. Including semantic analysis, word vectors (from spacy and local data TF-IDF, bag of words), string distance (cosine similarity, Dice distance, Jacquard distance), statistics distance, cooccurrence. Inter-feature distributions and intra-feature distributions are considered for distance measure.
- Customize stratified cross validation. Reduce variance by over half (~0.0040 to ~0.0017).
- Take advantage of different models: gradient boosting tree (xgboost), neural network (keras), random forest (sklearn), ridge regression (sklearn) and lasso regression (sklearn).
- Optimize each model with automatic parameter selection processes (hyperopt).
- Ensemble by stacking metafeatures and important raw features. Metafeatures are from the prediction of 15 models and important raw features are 959 features with correlation to label greater than 0.05.

Other creators
See project
Airbnb: New User Bookings @Kaggle.com

Feb 2016

Predict users' first booking destinations from user profiles and web sessions logs.
- Rank 43rd/1463. Top 2.9%
- Improved the accuracy of gradient boosting tree algorithms (xgboost) and random forest (sklearn) predictions by feature selection and engineering mainly on age, timestamp and sessions data.
- Apply n-gram, tf-idf, NMF and PCA to extract features from web sessions data.
- Ensemble model by bagging and stacking.

See project
Prudential Life Insurance Assessment: Classifying Risk @Kaggle.com

Feb 2016

Developing a predictive model that accurately classifies risk 1 - 8 from over a hundred variables describing attributes of life insurance applicants.
-Rank 158th/2613. Top 6%.
-For this data set, xgboost performance is very sensitive to hyperparameters. Apply stacking to eliminate to the sensitivity of parameter and reduce the risk of overfitting. Local cross validation scores improve from ~0.61 to ~0.64.
-As an ordinal regression problem, improves the offset optimization by 3 fold…

Developing a predictive model that accurately classifies risk 1 - 8 from over a hundred variables describing attributes of life insurance applicants.
-Rank 158th/2613. Top 6%.
-For this data set, xgboost performance is very sensitive to hyperparameters. Apply stacking to eliminate to the sensitivity of parameter and reduce the risk of overfitting. Local cross validation scores improve from ~0.61 to ~0.64.
-As an ordinal regression problem, improves the offset optimization by 3 fold cross validation with back and forth scanning. Local score improves to ~0.688. Bagging 5 models with different random seeds to improve stability.

See project

Honors & Awards

Kaggle Master

Kaggle

Jul 2016

A Kaggle competitor with consistent and stellar competition results.

Consistency: at least 2 Top 10% finishes in public competitions
Excellence: at least 1 of those finishes in the top 10 positions
The Regents’ Fellowship

University of California, Irvine

Sep 2012
The Regents’ Fellowship

University of California, Irvine

Sep 2011
Chinese National Scholarship

Ministry of Education of the People's Republic of China

Oct 2010

Languages

English

-
Chinese

-
Shanghainese

-

View Li’s full profile

See who you know in common
Get introduced
Contact Li directly

Join to view full profile

Other similar profiles

Drishan Arora

Drishan Arora

Deep Cogito

7K followers
San Francisco Bay Area

View Profile
Zixuan You

Zixuan You

University of California, Santa Cruz

5K followers
San Francisco Bay Area

View Profile
Lalit Kundu

Lalit Kundu

Forbes Technology Council

38K followers
San Francisco, CA

View Profile
Jiaqi Zhang

Jiaqi Zhang

Google

7K followers
Mountain View, CA

View Profile
Yi Cui

Yi Cui

CSC Generation

7K followers
Palo Alto, CA

View Profile
Shiva Mahajan

Shiva Mahajan

Google

10K followers
Sunnyvale, CA

View Profile
Yiwen Chen

Yiwen Chen

Google

8K followers
Mountain View, CA

View Profile
Chun Wang

Chun Wang

Google

8K followers
Mountain View, CA

View Profile
Sasan Tavakkol

Sasan Tavakkol

Google

7K followers
Irvine, CA

View Profile
Mrinal Ahlawat

Mrinal Ahlawat

Google

12K followers
San Francisco Bay Area

View Profile
Saumya Pathak

Saumya Pathak

Google

979 followers
London

View Profile
Ji Xue

Ji Xue

Google

3K followers
New York, NY

View Profile
Suleman Kazi

Suleman Kazi

Airbnb

9K followers
New York City Metropolitan Area

View Profile
Sheetal Shalini

Sheetal Shalini

Google

3K followers
San Francisco Bay Area

View Profile
Cicy T.

Cicy T.

Google

1K followers
Mountain View, CA

View Profile
Khasan Bold

Khasan Bold

Google

5K followers
Sunnyvale, CA

View Profile
Jing Qi

Jing Qi

Google

8K followers
Sunnyvale, CA

View Profile
Jaclyn Coleman

Jaclyn Coleman

Google

885 followers
Austin, TX

View Profile
Maral Mesma khosroshahi

Maral Mesma khosroshahi

Microsoft

3K followers
Santa Clara, CA

View Profile
Chao Teng

Chao Teng

Facebook

7K followers
New York, NY

View Profile

Explore more posts

Explore top content on LinkedIn

Find curated posts and insights for relevant topics all in one place.

View top content

Add new skills with these courses

See all courses

See your mutual connections View mutual connections with Li Li can introduce you to 10+ people at Meta Sign in with Email or New to LinkedIn? Join now By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

About

Activity

4K followers

Ying Guo

Danielle Tay Mendez

Li Li 李力

Li Li 李力

Zhe Zhang

Li Li 李力

Alex Shcheglovitov 🇺🇦

Li Li 李力

Jeong-Yoon Lee

Li Li 李力

Li Li 李力

Lingjuan Peng

Marc Coram

Subhashini Venugopalan

Jerry Yu

大大帶我飛 Dadafly

Nikhil Mehta

Xianzhe Ma

Chi Tang

Robby Stein

Lingjuan Peng

Experience & Education

Meta

******** ********

View Li’s full experience

See their title, tenure and more.

Publications

Scientific Reports 2019

The Journal of Chemical Physics 2018

NeurIPS 2018 Workshop on Machine Learning for Molecules and Materials 2018

arXiv:1802.08219 2018

Nature Communications October 1, 2017

IEEE International Conference on Data Mining Workshop October 1, 2017

Phys. Rev. B 2016

Int. J. Quant. Chem. 2015

Int. J. Quant. Chem. 2015

Physical Review B August 27, 2010

Patents

Issued January 1, 2018 US 15490797

Projects

Aug 2016

Jul 2016

Jun 2016

Apr 2016

Feb 2016

Feb 2016

Honors & Awards

Kaggle Master

Kaggle

The Regents’ Fellowship

University of California, Irvine

The Regents’ Fellowship

University of California, Irvine

Chinese National Scholarship

Ministry of Education of the People's Republic of China

Languages

English

-

Chinese

-

Shanghainese

-

View Li’s full profile

Other similar profiles

Drishan Arora

Zixuan You

Lalit Kundu

Jiaqi Zhang

Yi Cui

Shiva Mahajan

Yiwen Chen

Chun Wang

Sasan Tavakkol

Mrinal Ahlawat

Saumya Pathak

View mutual connections with Li

Li can introduce you to 10+ people at Meta

or

New to LinkedIn? Join now

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.