Ruichen Bao
Master of Engineering in Computer Science @ Cornell University
Education
Cornell University
Aug 2025 – May 2026 (Expected)Master of Engineering in Computer Science (GPA: 4.129/4.0)
Zhejiang University
Sep 2019 – Jul 2024Bachelor of Engineering in Information Engineering (GPA: 3.9/4.0)
Experience
Alipay
May 2025 – Aug 2025Software Engineer Intern | Java
- Optimized location-based search ranking pipeline. Introduced full-text match strategy, boosting search result relevance and CTR by 15%.
- Enhanced tag retrieval system by integrating vector-based similarity search, capturing semantically related tags and increasing discovery rates by 30%.
- Engineered a crowd export data preview feature, mitigating risks with robust data validation and instant rollback mechanisms.
Zilliz
Nov 2024 – May 2025Software Engineer Intern | C++, Python
- Engineered an open-source AI code generation helper MCP tool with SSE streaming; designed a RAG pipeline with vector database to inject Pymilvus context, reducing LLM hallucinations. [github]
- Added asynchronous support for the open-source Pymilvus SDK utilizing asyncio, accelerating client-side throughput by 10x+. [github]
- Extended Milvus C++ SDK by implementing a gRPC interceptor and enhanced cross-platform accessibility with RESTful API support via cpp-httplib. [github]
- Contributed to GenAI open-source ecosystem by integrating FastAPI and Ollama into DeepSearcher (7.7k+ stars) backend, and automated Ruff workflows in CI/CD. [github]
Selected Projects
E-commerce Platform
Aug 2024 – Nov 2024Java, SpringBoot, Redis
- Developed a full-featured e-commerce platform with single sign-on (SSO) for user authentication.
- Ensured transactional integrity during high-traffic sales by implementing optimistic locking and a Redis-based distributed lock.
- Increased peak throughput by architecting an asynchronous processing system, utilizing Redis caching and a message queue (MQ) to boost system concurrency.
MiniSQL Database Engine
Aug 2022 – Nov 2022C++
- Developed a lightweight relational database engine from scratch, supporting standard SQL DDL/DML operations.
- Accelerated data retrieval by implementing a buffer pool manager with an LRU replacement policy.
- Improved query performance for large datasets by integrating an index manager based on a B+ Tree, enabling logarithmic-time lookups.
Skills
Languages: C++, Python, Java, Swift, C, SQL, Bash, JavaScript, HTML/CSS
Frameworks: Spring Boot, gRPC, PyTorch, TensorFlow, vLLM, CUDA, FastAPI, Django, SwiftUI, ROS
Infrastructure: Linux, Docker, MySQL, Redis, RabbitMQ, Elasticsearch, MinIO, Milvus