Experience

Sia Precision Education – Director of Recruitment

August 2021 – Current

  • Leverage technical knowledge to interview potential employees for various roles within the organization, including software engineers and data scientists to develop software customized for incarcerated students
  • Create various internship programs for undergraduate and graduate students
  • Design and supervise all recruitment initiatives (i.e. LinkedIn, university outreach, etc.)

Research Intern – Summers Lab, NIH Clinical Center (Supervised by Dr. Ronald Summers)

June 2021 – August 2021

  • Designed and implemented adversarial ResNet-18 architectures where the discriminator is trained to distinguish post-contrast images from synthetic scans meant to mimic specific tissue structures
  • Successfully generated realistic synthetic data that was inseparable from real post-contrast images
  • Implemented a pipeline for data preprocessing and synthetic data generation that will be used for dataset augmentation prior to pre-training large neural networks (e.g., vision transformer) for downstream tasks

NLP Research Assistant – Case Western Reserve University (Supervised by Yanfang Ye and Long Phan)

January 2021 – August 2021

  • Developed CoTexT, a transformer model for multi-task learning on natural language and programming language data that achieved state-of-the-art results on the Microsoft xGLUE benchmark datasets
  • Designed multiple learning objectives for self-supervised learning on both unimodal (only PL) and bimodal inputs (NL and PL)
  • Modified encoder-only models (e.g., PhoBERT) to behave as a decoder block by changing self-attention layers from bidirectional into left-context-only and randomly initializing a cross-attention mechanism
  • Developed the PhoBERT2PhoBERT and ViBERT2ViBERT encoder-decoder models and achieved state- of-the-art performances on summarization tasks (VietNews dataset)
  • Developed SciFive, a transformer model for generative tasks in the biomedical domain (e.g., question- answering, summarization)

Volunteer Research Assistant – National Cancer Institute (Supervised by Dr. Grégoire Altan-Bonnet)

 December 2020 – June 2021

  • Performed data collection and preprocessing in order to pre-train a transformer model on approximately 1-billion single-cells with different sets of features
  • Assisted with design, testing, and validation of HAL-x, an extremely scalable hierarchical clustering algorithm that trains supervised classifiers to learn the hierarchy on raw, unprocessed data
  • Achieved state-of-the art runtimes on datasets with >10 million data points (i.e., mass cytometry datasets)

Science Team Intern – Predictuv Technologies Inc., Austin, Texas (Supervised by D. Christopher Keil)

May 2019 – December 2019

  • Assisted with mass data collection and data quality assurance to prepare training data for machine learning algorithms applied to image-driven sentiment analysis used in casino software
  • Developed an LSTM-CNN for predicting emotions from facial movements in longitudinal imaging data
  • Developed a detect-and-track program that predicts human action related to various sports (e.g., wide receivers running routes) to provide athletes with real-time feedback on technique/performance