About
AI Infra, Distributed Systems
Articles by Tao
Activity
-
🚀 Excited to share our work on In-Kernel Broadcast Optimization , a significant leap in Meta’s large-scale RecSys inference! In-Kernel Broadcast…
🚀 Excited to share our work on In-Kernel Broadcast Optimization , a significant leap in Meta’s large-scale RecSys inference! In-Kernel Broadcast…
Liked by Tao Lin
-
Most teams today are renting intelligence. Today, RadixArk is launching with $100M in seed funding to let enterprises and AI builders own it instead.…
Most teams today are renting intelligence. Today, RadixArk is launching with $100M in seed funding to let enterprises and AI builders own it instead.…
Liked by Tao Lin
-
Congrats RadixArk! From SGLang to Miles, and to future products, RadixArk is dedicated to building a crucible capable of repeatedly producing…
Congrats RadixArk! From SGLang to Miles, and to future products, RadixArk is dedicated to building a crucible capable of repeatedly producing…
Liked by Tao Lin
Experience
Education
-
Carnegie Mellon University
4.05/4.33
-
Master of Computational Data Science (MCDS)
- System courses: Advanced Database Systems, Advanced Cloud Computing, Parallel Computer Architecture and Programming
- ML/NLP courses: Machine Learning, ML with Large Datasets, Deep Learning, ML for Text Mining, Search Engines, Natural Language Processing -
-
-
-
-
-
Publications
-
FoundationDB Record Layer: A Multi-Tenant Structured Datastore
2019 International Conference on Management of Data (SIGMOD ’19)
The FoundationDB Record Layer is an open source library that provides a record-oriented data store with semantics similar to a relational database implemented on top of FoundationDB, an ordered, transactional key-value store. The Record Layer provides a lightweight, highly extensible way to store structured data. It offers schema management and a rich set of query and indexing facilities, some of which are not usually found in traditional relational databases, such as nested record types…
The FoundationDB Record Layer is an open source library that provides a record-oriented data store with semantics similar to a relational database implemented on top of FoundationDB, an ordered, transactional key-value store. The Record Layer provides a lightweight, highly extensible way to store structured data. It offers schema management and a rich set of query and indexing facilities, some of which are not usually found in traditional relational databases, such as nested record types, indexes on commit versions, and indexes that span multiple record types. The Record Layer is stateless and built for massive multi-tenancy, encapsulating and isolating all of a tenant’s state, including indexes, into a separate logical database. We demonstrate how the Record Layer is used by CloudKit, Apple’s cloud backend service, to provide powerful abstractions to applications serving hundreds of millions of users. CloudKit uses the Record Layer to host billions of independent databases, many with a common schema. Features provided by the Record Layer enable CloudKit to provide richer APIs and stronger semantics with reduced maintenance overhead and improved scalability.
Other authorsSee publication -
Rubystar: A non-task-oriented mixture model dialog system
1st Proceedings of Alexa Prize (Alexa Prize 2017)
RubyStar is a dialog system designed to create “human-like” conversation by combining different response generation strategies. RubyStar conducts a non- task-oriented conversation on general topics by using an ensemble of rule-based, retrieval-based and generative methods. Topic detection, engagement monitoring, and context tracking are used for managing interaction. Predictable elements of conversation, such as the bot’s backstory and simple question answering are handled by separate modules…
RubyStar is a dialog system designed to create “human-like” conversation by combining different response generation strategies. RubyStar conducts a non- task-oriented conversation on general topics by using an ensemble of rule-based, retrieval-based and generative methods. Topic detection, engagement monitoring, and context tracking are used for managing interaction. Predictable elements of conversation, such as the bot’s backstory and simple question answering are handled by separate modules. We describe a rating scheme we developed for evaluating response generation. We find that character-level RNN is an effective generation model for general responses, with proper parameter settings; however other kinds of conversation topics might benefit from using other models.
Other authorsSee publication -
TieVis: Visual Analytics of Evolution of Interpersonal Ties
10th International Conference on E-Learning and Games (Edutainment 2016)
-
Mobility Viewer: A Eulerian Approach for Studying Urban Crowd Flow
IEEE Transactions on Intelligent Transportation Systems
Honors & Awards
-
National Scholarship
Ministry of Education, China
-
Google Excellence Scholarship
Google Inc.
Languages
-
Chinese
Native or bilingual proficiency
-
English
Full professional proficiency
More activity by Tao
-
I'm hiring a Software Engineer to join my team building backend services for Apple Games in Seattle! If you are interested or know someone that might…
I'm hiring a Software Engineer to join my team building backend services for Apple Games in Seattle! If you are interested or know someone that might…
Liked by Tao Lin
-
Last week was my last at Meta. A little over a year ago, I left Google to join PyTorch and move to New York. It was scary to leave something good…
Last week was my last at Meta. A little over a year ago, I left Google to join PyTorch and move to New York. It was scary to leave something good…
Liked by Tao Lin
-
Today we're releasing Muse Spark, the first model from MSL. Nine months ago we rebuilt our AI stack from scratch. New infrastructure, new…
Today we're releasing Muse Spark, the first model from MSL. Nine months ago we rebuilt our AI stack from scratch. New infrastructure, new…
Liked by Tao Lin
-
Just wrapped up my first week as a Software Engineer at Sierra , and it’s been an absolute blast! 🚀 #NewJob
Just wrapped up my first week as a Software Engineer at Sierra , and it’s been an absolute blast! 🚀 #NewJob
Liked by Tao Lin
Other similar profiles
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top content