Tao Lin

Tao Lin

Greater Boston
2K followers 500+ connections

About

AI Infra, Distributed Systems

Articles by Tao

Activity

Join now to see all activity

Experience

  • Meta Graphic
  • -

    Cupertino, California, United States

  • -

    Cupertino, California, United States

  • -

    Greater Pittsburgh Area

  • -

    Mountain View, CA

  • -

  • -

    Hong Kong SAR

Education

  • Carnegie Mellon University Graphic

    Carnegie Mellon University

    4.05/4.33

    -

    Master of Computational Data Science (MCDS)

    - System courses: Advanced Database Systems, Advanced Cloud Computing, Parallel Computer Architecture and Programming
    - ML/NLP courses: Machine Learning, ML with Large Datasets, Deep Learning, ML for Text Mining, Search Engines, Natural Language Processing

  • -

    -

  • -

    -

Publications

  • FoundationDB Record Layer: A Multi-Tenant Structured Datastore

    2019 International Conference on Management of Data (SIGMOD ’19)

    The FoundationDB Record Layer is an open source library that provides a record-oriented data store with semantics similar to a relational database implemented on top of FoundationDB, an ordered, transactional key-value store. The Record Layer provides a lightweight, highly extensible way to store structured data. It offers schema management and a rich set of query and indexing facilities, some of which are not usually found in traditional relational databases, such as nested record types…

    The FoundationDB Record Layer is an open source library that provides a record-oriented data store with semantics similar to a relational database implemented on top of FoundationDB, an ordered, transactional key-value store. The Record Layer provides a lightweight, highly extensible way to store structured data. It offers schema management and a rich set of query and indexing facilities, some of which are not usually found in traditional relational databases, such as nested record types, indexes on commit versions, and indexes that span multiple record types. The Record Layer is stateless and built for massive multi-tenancy, encapsulating and isolating all of a tenant’s state, including indexes, into a separate logical database. We demonstrate how the Record Layer is used by CloudKit, Apple’s cloud backend service, to provide powerful abstractions to applications serving hundreds of millions of users. CloudKit uses the Record Layer to host billions of independent databases, many with a common schema. Features provided by the Record Layer enable CloudKit to provide richer APIs and stronger semantics with reduced maintenance overhead and improved scalability.

    Other authors
    See publication
  • Rubystar: A non-task-oriented mixture model dialog system

    1st Proceedings of Alexa Prize (Alexa Prize 2017)

    RubyStar is a dialog system designed to create “human-like” conversation by combining different response generation strategies. RubyStar conducts a non- task-oriented conversation on general topics by using an ensemble of rule-based, retrieval-based and generative methods. Topic detection, engagement monitoring, and context tracking are used for managing interaction. Predictable elements of conversation, such as the bot’s backstory and simple question answering are handled by separate modules…

    RubyStar is a dialog system designed to create “human-like” conversation by combining different response generation strategies. RubyStar conducts a non- task-oriented conversation on general topics by using an ensemble of rule-based, retrieval-based and generative methods. Topic detection, engagement monitoring, and context tracking are used for managing interaction. Predictable elements of conversation, such as the bot’s backstory and simple question answering are handled by separate modules. We describe a rating scheme we developed for evaluating response generation. We find that character-level RNN is an effective generation model for general responses, with proper parameter settings; however other kinds of conversation topics might benefit from using other models.

    Other authors
    See publication
  • TieVis: Visual Analytics of Evolution of Interpersonal Ties

    10th International Conference on E-Learning and Games (Edutainment 2016)

  • Mobility Viewer: A Eulerian Approach for Studying Urban Crowd Flow

    IEEE Transactions on Intelligent Transportation Systems

Honors & Awards

  • National Scholarship

    Ministry of Education, China

  • Google Excellence Scholarship

    Google Inc.

Languages

  • Chinese

    Native or bilingual proficiency

  • English

    Full professional proficiency

More activity by Tao

View Tao’s full profile

  • See who you know in common
  • Get introduced
  • Contact Tao directly
Join to view full profile

Other similar profiles

Explore top content on LinkedIn

Find curated posts and insights for relevant topics all in one place.

View top content

Add new skills with these courses