About
Activity
2K followers
Experience & Education
Publications
-
Hashed Dynamic Blocking for Very Large Databases
DINA '20 at European Conference on Machine Learning ECML
-
SuperPart: Supervised graph partitioning for record linkage
2018 IEEE International Conference on Data Mining (ICDM'18)
-
Design Challenges in Named Entity Transliteration
Proceedings of COLING 2018, the 27th International Conference on Computational Linguistics
-
Improving Accuracy of Patient Demographic Matching and Identity Resolution
The University of Memphis, ProQuest Dissertations Publishing
See publicationPhD Dissertation
-
Grapheme to Phoneme Translation using Conditional Random Fields with Re-ranking
TSD 2016 Proceedings of the 19th International Conference on Text, Speech and Dialogue
-
Incorporating Syllable Phonotactics to Improve Grapheme to Phoneme Translation
Future and Emerging Trends in Language Technologies FETLT 2016
-
Embracing the Sparse, Noisy, and Interrelated Aspects of Patient Demographics for use in Clinical Medical Record Linkage
AMIA Summits on Clinical Research Informatics 2015
-
The Discriminating Power of Information within Patient Demographics for Clinical Medical Record Linkage
GIS '14 Global Identity Summit
(Invited Talk)
-
Optimizing database index performance for solid state drives
IDEAS '14 Proceedings of the 18th International Database Engineering & Applications Symposium
Patents
-
Natural Language Query Processing
Issued US12265528B1
See patentTechniques for handling natural language query processing are described. In some examples, a sequence-to-sequence model is used to handle a natural language query. Post-processing of a result of the sequence-to-sequence model utilizes fine-grained information from an entity linker. In some examples, the sequence-to-sequence model and aspects of a natural language query pipeline are used to handle a natural language query.
-
Row level security in natural language question answering
Issued US 12223080
This disclosure describes a natural language question (NLQ) query service within a service provider network that provides row level security (RLS) for autocomplete during entry of NLQs and fuzzy matching in NLQ answering. The rules take the form of per-user predicates such as Tim can only see rows with region= US. In configurations a complex extraction and preprocessing pipeline to extract distinct combinations of values against RLS predicate “rule keys” is used. Those distinct values are…
This disclosure describes a natural language question (NLQ) query service within a service provider network that provides row level security (RLS) for autocomplete during entry of NLQs and fuzzy matching in NLQ answering. The rules take the form of per-user predicates such as Tim can only see rows with region= US. In configurations a complex extraction and preprocessing pipeline to extract distinct combinations of values against RLS predicate “rule keys” is used. Those distinct values are indexed along with grouped rule keys to enable pushing down predicates at auto-complete time. This enables pushing part of RLS rule handling to ingestion time of a dataset rather than handling all RLS rule handling at query time, enabling meeting of latency goals. In configurations, a single logical document of unique cell values is split into multiple documents with a subset of rule keys to handle scalability limits.
-
INTERACTIVE ASSISTANCE FOR EXECUTING NATURAL LANGUAGE QUERIES TO DATA SETS
Issued US 11,604,794
-
Supervised graph partitioning for record matching
Issued US 11514054
-
MEMORY-EFFICIENT STREAMING COUNT ESTIMATION FOR MULTISETS
Issued US 11,314,730
-
Scalable Parallel Elimination of Approximately Subsumed Sets
Issued US 11,086,940
-
System and Method of Partitioned Lexicographic Search
Issued US US9129010 B2
-
Semantic Address Parsing Using a Graphical Discriminative Probabilistic Model
Filed US US 20160147943
The system comprises a processor, a memory, and an application that comprises a semantic address parser that incorporates a graphical discriminative probabilistic model. When executed by the processor the application receives an address as input comprising tokens and for each token identifies a feature value of at least one feature associated with the token. The application analyzes the feature values to determine an address label for each token and based on the address labels of the tokens…
The system comprises a processor, a memory, and an application that comprises a semantic address parser that incorporates a graphical discriminative probabilistic model. When executed by the processor the application receives an address as input comprising tokens and for each token identifies a feature value of at least one feature associated with the token. The application analyzes the feature values to determine an address label for each token and based on the address labels of the tokens, converts the input patient address to a canonical address format.
Other inventorsSee patent
Projects
-
RoboPhd
-
JOpenFST - Java Weighted Finite State Transducers
- Present
See projectjava implementation of WFSTs inspired by openfst
-
Invited participant Queer Health Hackathon - Broad Institute of MIT and Harvard
-
Weekend bringing together clinicians, health care policy experts, and data scientists to dig into an EHR dataset to discover preliminary findings about healthcare quality, access, and outcomes for the LGBTQ population.
Other creators -
Honors & Awards
-
2nd Place Student Research Symposium
University of Memphis
Placed 2nd for research on the usage of neural networks for image analysis and feature extraction of check images in the banking industry
-
Metavante Key Results
Metavante
Awarded to individuals who are key resources in contributing to the success of the companies strategic goals
-
Presidents Award
Tristate Independent Theatre Association
Awarded to a single individual each year for leadership and contribution to business initiatives.
-
Jack U. Russell Award for Outstanding Work In Computer Science
Rhodes College, Math Department
Awarded by the Math and Computer Science department at Rhodes College to a single student for outstanding work during their undergraduate career. The recipient is nominated and decided by the faculty.
-
Outstanding Work in First-year Computer Science
Rhodes College, Math Department
Award given to a single first-year student in Computer Science by the Math and Computer Science department
Recommendations received
3 people have recommended Steve
Join now to viewOther similar profiles
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top content