Professional Experience

DRW

Software Developer | July 2022 - Present
Chicago, Illinois, United States · Hybrid

Generative AI

  • Built AI/ML/NLP systems for language model research, evaluation, search, retrieval, and deployment
  • Managed infrastructure for scaling ML workloads (data, inference) to multiple nodes and GPUs
  • Implemented retrieval augmented generation (RAG) pipelines, vector store indexing, agents
  • Built distributed data pipelines (Spark/Ray, ETLs), debugging tools for production services
  • Decreased model fine-tune/inference time and memory by 20% with tensor parallelism
  • Technologies: Python, PyTorch, Tensorflow, Ray, DeepSpeed, LangChain, MLFlow, Numpy, Spark, Pandas, Flask, PostgreSQL, Kubernetes, Javascript, GRPC/protobuf, Kafka, CI/CD

Automated Trading, Risk Analytics

  • Developed trading-critical systems for financial risk analytics, automated trading, price/size calculation
  • Managed microservices for distributed data processing (Kafka, Python GRPC APIs, ETLs, Kubernetes)
  • Worked on full product lifecycle from research (Jupyter notebooks) to productionizing (K8s), testing, and support
  • Developed tools for debugging/tracing system state, data flow, visualization, simulation, and scenario analysis
  • Implemented numerical/statistical models for option risk surface dynamics, PCA/matrix decomp, regressions
  • Technologies: Python, Pandas, Numpy/Scipy, Flask, Kubernetes, Pyarrow, GRPC, Redis, distributed systems

Meta

Software Engineer Intern | May 2022 - July 2022
Menlo Park, California, United States · On-site

  • Worked on the Language Understanding and Question Answering (LUQA) team, under Reality Labs AI
  • Optimized neural language models (LMs) for long-form question answering (QA) and summarization tasks
  • Fine-tuned and benchmarked retriever/reader models (BERT, RoBERTa, DPR) on meeting/conversation datasets
  • Researched model quantization, distillation, pruning, dense retrieval, attention models
  • Developed web apps to collect and crowdsource datasets on evaluating model generated output, demo capabilities
  • Technologies: Python, React, PyTorch, BERT, RoBERTa, DPR

University of Maryland

Research Assistant | June 2019 - May 2022
College Park, Maryland

  • Wrote and published papers in NLP conferences (EMNLP, EACL) on the topics of question answering, human-AI collaboration and topic models
  • Developed web app to accelerate coding (annotation) of technical documents, using active learning and NLP
  • Achieved theoretical speed up of 4x with comparable accuracy using topic models and text classification models
  • Applied computer vision techniques to multiple object detection, pose estimation and action recognition
  • Technologies: Python, PyTorch, NLP, Computer Vision, Web Development