About
Ningyue Liang is an M.E. candidate in CSE at Harvard University. He graduated from Washington University in St. Louis as a valedictorian with double majors in Computer Science and Statistics. His profound interest in software engineering, machine learning, and AI, coupled with his industrial experience and academic achievements, marks him as an emerging professional equipped to face and solve complex technological challenges.
Internship
Boondoggle AI, Software Engineer Intern
June 2024 – Aug. 2024; San Francisco, CA
Skills: TypeScript, Python, React.JS, LangChain, NLP, LLM, Supabase
Built generative AI solutions to streamline CRM operations, targeting support for 4,000+ clients by automating workflows
Developed automated pipelines with TypeScript, REST APIs, and GPT-4o (LLM) to create CRM contacts & deals by summarizing messages and interactions from LinkedIn and emails, reducing clients’ manual data entry by 70%
Enhanced the accuracy of our semantic search and querying system by integrating LangChain, Pinecone (Vector database), and RAG, improving data processing time by 30% and response quality based on user tests
Created non-blocking unified APIs to automate missing profile completion for major CRMs (Salesforce, HubSpot, etc.)
Impossible Sensing, LLC, Software Engineer Intern
June 2023 – July 2023; St. Louis, MO
Skills: Python, C++, REST API, Django, Flask, React.JS, OOP, PostgreSQL, AWS
Collaborated with a cross-functional team of 5 to develop optical sensing software for NASA exploration detector projects, deploying scalable solutions on AWS using Docker to support over 1,000 users
Developed backend services with the Django MVT framework and REST APIs for record management, handling CRUD operations, data processing, and ML model predictions; implemented real-time camera image collection using OpenCV
Designed generic PostgreSQL schemas and a 2TB AWS RDS-based storage system for data management and replications
Crafted a responsive front-end interface using React.JS and Tailwind CSS, utilizing Dash for 8 interactive data visualizations
Enhanced IPC with async APIs and JSON-RPC for delay generators, improving timing control by 35% at millisecond levels
Sobriety Hub, LLC, Software Engineer Intern
May 2022 – Aug. 2022; St. Louis, MO
Skills: Next.JS, React.JS, REST API, PostgreSQL, OOP, AWS
Developed an operating management application for sober living homes and launched the website with over 500 users
Built Next.JS APIs and Prisma models to speed up event registration by 40% and support operating analysis functions
Created PostgreSQL schemas and built data upload functions with AWS EC2 & S3 for operation and record management
Implemented responsive frontend UI and data visualizations with React, D3.JS, and Chakra-UI, resulting in 30% more visits
Research
One for All: Towards Training One Graph Model for All Classification Tasks
Accepted for ICLR 2024 Spotlight presentation (top 5%)
Paper: https://openreview.net/forum?id=4IT2pgc9v6
Authors: Hao Liu, Jiarui Feng, Lecheng Kong, Ningyue Liang, Dacheng Tao, Yixin Chen, Muhan Zhang
OFA is a general Graph Classification Framework that can solves a wide range of graph classification tasks with a single model and a single set of parameters. The tasks are cross-domain (e.g. citation network, molecular graph,…) and cross-tasks (e.g. few-shot, zero-shot, graph-level, node-leve,…)
OFA use natural languages to describe all graphs, and use a LLM to embed all description in the same embedding space, which enable cross-domain training using a single model.
OFA propose a prompting paradiagm that all task information are converted to prompt graph. So subsequence model is able to read tasks information and predict relavent target accordingly, without having to adjust model parameters and architecture. Hence, a single model can be cross-task.
OFA curated a list of graph datasets from a different sources and domains and describe nodes/edges in the graphs with a systematical decription protocol. We thank previous works including, OGB, GIMLET, MoleculeNet, GraphLLM, and villmow for providing wonderful raw graph/text data that make our work possible.
Advanced Astrophysics Telescope Program at Washington University in St. Louis
Developed with a team of 6 to build a real-time localization system in C++ with object-oriented designs for GRB detections
Implemented reconstruction and localization algorithms with Compton formulas to handle energy data processed by FPGAs
Built an end-to-end ML infrastructure with Python and TensorFlow for preprocessing, training, and serving with 70M data
Optimized the localization pipeline by training and tuning with 40+ versions of regression models (including KNN, ANN, SVM, RF, and XGBoost), and improved its accuracy by 30%, achieving less than 1.5 degrees of error for 10k+ trials
Project
1. Mirror iOS App
(Accepted by App Store, not published) Introducing Mirror, the AI Interviewer App, powered by GPT and designed to help you ace your next interview! With Mirror, you can get a realistic mock practice of real interview questions, and receive real-time feedback from artificial intelligence on your performance. Mirror tracks your performance history, allowing you to see your progress and stay motivated. It’s a comprehensive tool that gives you everything you need to prepare for your interview with confidence.
2. Global Emissions Data Analysis & Interactive Visualizations
Data Analysis and interactive visualizations on 30 years of greenhouse gas emissions for all countries in the world through D3.JS.
3. EDUrain
EDUrain is a platform dedicated to helping every student attend college. We help students file for scholarships for free, and offer a free housing search through our housing engine. Additionally, we’ve built into our platform other services to make the student’s life easier, including renter’s insurance, credit building, and roommate matching! We’re constantly improving our website as we build our startup, so look out for new features that are on the way!
Teaching
I have worked as a teaching assistant for 7 Computer Science Courses and engineering tutor at Washington University in St. Louis:
CSE247 DATA STRUCTURES AND ALGORITHMS
CSE204 WEB DEVELOPMENT
CSE217 INTRODUCTION TO DATA SCIENCE
CSE330S RAPID PROTOTYPE DEVELOPMENT AND CREATIVE PROGRAMMING
CSE332S OBJECT-ORIENTED SOFTWARE DEVELOPMENT LABORATORY
CSE361S INTRODUCTION TO SYSTEMS SOFTWARE
CSE438S MOBILE APPLICATION DEVELOPMENT
Tech Stack
| | |
| | |
| | |
| | |
| | |