Research & Experience

Graduate Research Assistant

Artificial Intelligence Institute, University of South Carolina

Columbia, SC, USA

Aug 2019 - Present

Leading research and development on next-generation knowledge graph platforms and agentic AI systems at the AI Institute.

EMPWR: The Next Gen Knowledge Graph Platform

  • Led the development and shipped EMPWR, a comprehensive platform designed to manage the end-to-end KG lifecycle, including design, ingestion, enrichment, and maintenance
  • Built modularized workflows for extracting knowledge from unstructured, semistructured, and structured data sources
  • Large-Scale KG Construction: Developed the PercuroKG (in collaboration with WiPro), a pharmaceutical KG consisting of >6 million triples, 1.5 million nodes, and 3,000 relation types

C3AN: Compact, Custom, and Composite AI Systems

  • Designing MCP servers to enable and support agentic system integration for planning and orchestrating AI workflows
  • Providing technical leadership and mentoring effort in multimodal KG construction

Graduate Teaching Assistant

  • Teaching assistant for CSCE 145: Algorithm Design I and CSCE 146: Algorithm Design II

AI Advisor

MedHive.ai

Atlanta, GA, USA

Aug 2025 - Present

Leading the effort in developing a Neuro-symbolic AI approach involving small language models and knowledge graphs to ingest medical device documentation, process guidelines, and patent submissions to accelerate and automate medical device R&D.

  • Directed collaborative data science discovery projects with UC Berkeley to develop a medical device knowledge graph to support complex multi-hop reasoning and information retrieval with provenance from large-scale FDA 510(k) datasets
  • Engineered a Multi-Strategy (Document/Graph)-RAG with LLM-as-a-Judge approach to improve response traceability
  • Designed a KG-LLM response fidelity score to evaluate and audit the lineage of agents outputs and reduce hallucinations in medical device QA

Data Science Research Intern

Outreach.io

Seattle, WA, USA

May 2020 – Aug 2021

Pioneered knowledge graph initiatives to transform sales engagement data into actionable intelligence for enterprise sales teams.

  • Designed the Sales Engagement Ontology to unify semantic standards between data producers and consumers
  • Pioneered the Sales Engagement Knowledge Graph (SEG) to surface the people's information in sales activities from 4.7M engagement logs; improved the sales-rep-to-lead connections throughput by 20% (~64K newly discovered people entities)
  • Modeled the non-sequential sales processes with multi-label temporal graphs; transitioning from a linear "stages" view to a multi-dimensional "activity" view to improve context and next best actions

Research Intern

National Library of Medicine, NIH

Bethesda, MD, USA

May 2019 – Jan 2022

Developed context-enriched deep learning models to align knowledge from >200 heterogeneous sources for the UMLS Metathesaurus; streamlined the maintenance effort with humans in the loop.

  • Benchmarked performance with various Knowledge Graph Embedding techniques and Siamese Network architectures to scale biomedical vocabulary alignment across 172 millions term pairs; achieved a 5.0% increase in precision (>94% F1) and a 50% reduction in false positive rates compared to lexical baselines
  • Optimized large-scale training pipelines using Keras, TensorFlow, and Slurm on high-performance computing clusters, utilizing over 2,700 GPU hours for models training and validation

Graduate Research Assistant

Kno.e.sis Center, Wright State University

Dayton, OH, USA

May 2017 – Apr 2019

Project kHealth: Semantic Multisensory Mobile Approach to Personalized Asthma Care

Worked with SMEs and designed a Personalized Health Knowledge Graph to collect and integrate data from multimodal streams (clinical notes, mobile health application, and outdoor environmental observations); >30 parameters involving up to 1852 data points/day, collected throughout 1 or 3 month patient participation.

  • Developed conversational agents to improve patient engagement and personalized care in pediatric asthma management; achieved over 75% patient compliance (110 patients out of 150 study cohort)

Graduate Teaching Assistant

  • Teaching assistant for CS 1160: Introduction to Computer Programming

Publications

20+ Total Publications
8 Journal Papers
3 Conference Papers
8 Workshop Papers
1 Patent

Journal Papers

2025

Building Multimodal Knowledge Graphs: Automation for Enterprise Integration

Garimella, R., Yip, H.Y., Venkataramanan, R., & Sheth, A.P.

IEEE Internet Computing, 29(3), 76-84, 2025

2024

The EMPWR Platform: Data and Knowledge-Driven Processes for the Knowledge Graph Lifecycle

Yip, H.Y. & Sheth, A.

IEEE Internet Computing, 28(1), 61-69, 2024

2023

Knowledge Graph Empowered Machine Learning Pipelines for Improved Efficiency, Reusability, and Explainability

Venkataramanan, R., Tripathy, A., Foltin, M., Yip, H.Y., Justine, A., & Sheth, A.

IEEE Internet Computing, 27(1), 81-88, 2023

2019

Extending Patient-Chatbot Experience with Internet-of-Things and Background Knowledge

Sheth, A., Yip, H.Y., & Shekarpour, S.

IEEE Intelligent Systems, 34(4), 24-30, 2019

2019

kBot: Knowledge-enabled Personalized Chatbot for Asthma Self-Management

Kadariya, D., Venkataramanan, R., Yip, H.Y., Kalra, M., Thirunarayan, K., & Sheth, A.

IEEE International Conference on Smart Computing (SMARTCOMP), 138-143, 2019

2019

Determination of Personalized Asthma Triggers From Multimodal Sensing and a Mobile App

Venkataramanan, R., Kadariya, D., Yip, H.Y., Jaimini, U., Thirunarayan, K., Kalra, M., & Sheth, A.

JMIR Pediatrics and Parenting, 2(1), 2019

2019

Cognitive Services and Intelligent Chatbots: Current Perspectives and Special Issue Introduction

Sheth, A., Yip, H.Y., Iyengar, A., & Tepper, P.

IEEE Internet Computing, 2019

2018

How Will the Internet of Things Enable Augmented Personalized Health?

Sheth, A., Jaimini, U., & Yip, H.Y.

IEEE Intelligent Systems, 33(1), Jan-Feb 2018

Conference Papers

2022

Context-Enriched Learning Models for Aligning Biomedical Vocabularies at Scale in the UMLS Metathesaurus

Nguyen, V., Yip, H.Y., Bajaj, G., Wijesiriwardene, T., Javangula, V., Sheth, A., Parthasarathy, S., & Bodenreider, O.

The Web Conference (WWW), 2022

2020

Biomedical Vocabulary Alignment at Scale with the UMLS Metathesaurus

Nguyen, V., Yip, H.Y., & Bodenreider, O.

The Web Conference (WWW), 2020

2020

Siamese KG-LSTM: A Deep Learning Model for Enriching UMLS Metathesaurus Synonymy

Tran, T.T.T., Nghiem, S.V., Le, V.T., Quan, T.T., Nguyen, V., Yip, H.Y., & Bodenreider, O.

12th International Conference on Knowledge and Systems Engineering (KSE), 2020

Workshop Papers

2022

Evaluating Biomedical BERT Models for Vocabulary Alignment at Scale in the UMLS Metathesaurus

Bajaj, G., Nguyen, V., Wijesiriwardene, T., Yip, H.Y., Javangula, V., Parthasarathy, S., Sheth, A., & Bodenreider, O.

Workshop on Insights from Negative Results in NLP, ACL, 2022

2021

Using Contact, Content, and Context in Knowledge-Infused Learning: A Case Study of Non-Sequential Sales Processes

Yip, H.Y., Liu, Y., & Sheth, A.

Knowledge Graph Conference Workshop on Knowledge-Infused Learning, 2021

2019

Construction of UMLS Metathesaurus with Knowledge-Infused Deep Learning

Yip, H.Y., Nguyen, V., & Bodenreider, O.

2nd International Contextualised Knowledge Graph workshop (CKG), ISWC, 2019

2019

Singleton Property Graph: Adding A Semantic Web Abstraction Layer to Graph Databases

Nguyen, V., Yip, H.Y., Thakkar, H., Li, Q., Bolton, E., & Bodenreider, O.

2nd International Contextualised Knowledge Graph workshop (CKG), ISWC, 2019

2018

Augmented Personalized Health: Using Semantically Integrated Multimodal Data for Patient Empowered Health Management Strategies

Sheth, A., Yip, H.Y., Jaimini, U., Kadariya, D., Sridharan, V., Venkataramanan, R., Banerjee, T., Thirunarayam, K., & Kalra, M.

mHealth Technology Showcase, NIH, June 2018

2018

kHealth Digital Personalized Healthcare technology for Pediatric Asthma

Jaimini, U., Yip, H.Y., Venkataramanan, R., Kadariya, D., Sridharan, V., Banerjee, T., Thirunarayam, K., Kalra, M., & Sheth, A.

mHealth Technology Showcase, NIH, June 2018

2018

Feasibility of Recording Sleep Quality And Sleep Duration Using Fitbit in Children with Asthma

Sheth, A., Yip, H.Y., Jaimini, U., Kadariya, D., Sridharan, V., Venkataramanan, R., Banerjee, T., Thirunarayam, K., & Kalra, M.

32nd Annual Meeting of the Associated Professional Sleep Societies (SLEEP), 2018

2018

Correlating Multimodal Signals with Asthma Control in Children Using kHealth System

Kalra, M., Sheth, A., Banerjee, T., Jaimini, U., Kadariya, D., Sridharan, V., Thirunarayam, K., Venkataramanan, R., & Yip, H.Y.

American Thoracic Society, 2018

Patents & Preprints

2024

Robust Useful and General Task-oriented Virtual Assistants

Srivastava, B., Lakkaraju, K., Venkataramanan, R., Pallagani, V., Khandelwal, V., & Yip, H.Y.

U.S. Patent 12,067,983, issued August 20, 2024

2023

RESTORE: Graph Embedding Assessment Through Reconstruction

Yip, H.Y., Ravuru, C., Banerjee, N., Jha, S., Sheth, A., Chadha, A., & Das, A.

arXiv preprint arXiv:2308.14659, 2023

2022

UBert: A Novel Language Model for Synonymy Prediction at Scale in the UMLS Metathesaurus

Wijesiriwardene, T., Nguyen, V., Bajaj, G., Yip, H.Y., Javangula, V., Mao, Y., Fung, K.W., Parthasarathy, S., Sheth, A.P., & Bodenreider, O.

arXiv preprint arXiv:2204.12716, 2022

2019

Electronic Health Record Integration

Yip, H.Y., Taib, N.A., Khan, H.A., & Dhillon, S.K.

Encyclopedia of Bioinformatics and Computational Biology, vol. 2, pp. 1063-1076. Oxford: Elsevier, 2019

Tutorials

2024

Knowledge-driven Processes for Big Data Management and Applications

Yip, H.Y., Wickramarachchi, R., Venkataramanan, R., & Sheth, A.

Tutorial at IEEE BigData, 2024

2024

Data and Knowledge-Driven Processes for the Knowledge Graph Lifecycle

Yip, H.Y. & Sheth, A.

Tutorial at Sixth International Knowledge Graph and Semantic Web Conference (KGSWC), December 2024

Volunteer Services

Program Committee

Reviewer

  • ACL Rolling Review - May 2025
  • PeerJ Computer Science
  • IEEE Access
  • IEEE Transactions
  • IEEE Internet Computing
  • JMIR AI Journal
  • International Conference on Web and Social Media (ICWSM) - 2020
  • Association for Computational Linguistics (ACL) - 2020
  • International Semantic Web Conference (ISWC) - 2018
  • The Web Conference (WWW) - 2018
  • AAAI Conference on Artificial Intelligence - 2018

Invited Speaker

  • Collaborative Assistants for the Society (CASY) - 2020
  • AIISC Summer Camp - 2024, 2025

Technical Skills

Programming Languages

Python Java C++ Perl R JavaScript TypeScript

Semantic Technologies

RDF OWL SPARQL Cypher Ontology

ML/AI Frameworks & Libraries

TensorFlow PyTorch Huggingface Langchain Pandas NumPy Matplotlib Scikit-Learn

Database Management Systems

MySQL MongoDB Neo4j Elasticsearch Redis

Distributed & Scalable Computing

Apache Spark MapReduce Hadoop Slurm HPC

Web Development

Node.js Express React Bootstrap HTML CSS XML SVG