Profile Picture

Karthik Uppuluri

About Me

Hey there! I'm fascinated by how machines can understand human language, and that's what I do for a living. I build systems that can read, understand, and answer questions just like humans do (well, almost!). Think of it as teaching computers to be better conversation partners.

While I geek out on Machine Learning and AI in general, what really gets me excited is working with different languages. Telugu, my mother tongue, holds a special place in my heart. I'm on a mission to make AI work better for languages that big tech often overlooks. It's like bringing the power of AI to the languages that deserve more attention!

When I'm not diving deep into code or playing with language models, you'll find me hitting the pavement for a long run or strumming away on my string instruments. Running helps me clear my head, and music... well, let's just say my instruments are patient with my amateur attempts at making music! ๐Ÿ˜„

Experience

University of Houston SAP Fidelity

Patents

Cognitive Enterprise System

Granted

Systems and methods for intelligent query processing using natural language understanding and knowledge graphs. Tokenizes user queries, identifies intents, and executes relevant functions to generate contextual responses using enterprise data sources.

Contextual Natural Language Processing In Enterprise Applications

Granted

Converts natural language instructions into executable software transactions. Uses NLP to transform user inputs into application-specific formats, filters based on context parameters, and matches to appropriate transactions for execution.

Large Language Model Guardrails

Filed

Safety and control mechanisms for large language model deployments in enterprise environments.

Publications

Research Publications

Text2Zinc: A Cross-Domain Dataset for Modeling Optimization and Satisfaction Problems in MiniZinc

INFORMS 2025

A novel cross-domain dataset for bridging natural language and constraint optimization modeling using MiniZinc.

View on arXiv

Ner4Opt: Named Entity Recognition for Optimization Modelling from Natural Language

Constraints 2024

Journal paper presenting a named entity recognition approach to extract optimization components from natural language problem descriptions.

View on Springer

Ner4Opt: Named Entity Recognition for Optimization Modelling from Natural Language

CPAIOR 2023

Conference paper introducing the Ner4Opt framework for automated optimization model generation from natural language.

View on Springer

Natural Language for Optimization

NeurIPS 2022

Poster presentation on leveraging natural language processing for optimization problem formulation.

View Poster (PDF)

Citations & Presentations

Talks & Presentations

Telugu Language Modeling & NLP Resources

Manabadi Sadassu โ€ข July 2024

Presentation on Telugu corpus database, web applications for Telugu NLP, and demos of language models for the Telugu community.

Generative AI

All Things Open RTP Meetup โ€ข May 2023

Exploring the landscape of generative AI technologies, applications, and implications for developers and enterprises.

Open Source AI Models

Telugu BERT

kuppuluri/telugu_bertu

9,214

First-ever BERT Masked Language Model trained on the Telugu language from scratch. A foundational model for Telugu NLP tasks, enabling transfer learning for various downstream applications.

BERT Telugu MLM Transformers
View on Hugging Face

Telugu Comprehension

kuppuluri/telugu_bertu_tydiqa

1,821

Telugu Question-Answering model fine-tuned on Google's TyDiQA dataset. Enables extractive question answering for Telugu text, advancing comprehension capabilities for low-resource languages.

Question Answering Telugu TyDiQA BERT
View on Hugging Face

More Models & Demos

For the latest models, demos, and experiments, visit my Hugging Face profiles:

Academic Service

Invited Reviewer

NeurIPS 2025

Neural Information Processing Systems - Premier ML/AI Conference

View Program Committee

Invited Reviewer

AlphaXiv

AI-powered preprint review platform

View Profile

Open Source Libraries

Created & Maintained

Ner4Opt

Active

Named Entity Recognition for Optimization - A framework for extracting optimization components from natural language problem descriptions and generating constraint programming models.

Python NLP Optimization NER

Telugu Language Modeling

Active

Experiments in building language models for Telugu from scratch - exploring low-resource NLP techniques for underrepresented languages.

Python NLP Telugu Language Modeling

Contributed

TextWiser

Active

Text Featurization Library - A flexible and easy-to-use library for text featurization with support for various embedding techniques and transformers. Developed at Fidelity Investments.

Python NLP Text Embeddings Machine Learning

Hobbies

Running
Guitar
Ukulele