James Fan

James Fan

New York, New York, United States
1K followers 500+ connections

About

Proven track record of developing, applying & scaling cutting edge AI technology to…

Articles by James

  • Customer Service Holiday Blues

    I know we just had Labor Day, but as all retailers in US know, it’s time to get ready for the busiest of time of the…

Activity

1K followers

See all activities

Experience

  • Tomato.ai (Sold to Sanas) Graphic

    Tomato.ai (Sold to Sanas)

    New York City Metropolitan Area

  • -

    Greater New York City Area

  • -

    Greater New York City Area

  • -

  • -

    Greater New York City Area

  • -

    Greater New York City Area

  • -

  • -

  • -

Education

Publications

  • Building Joint Spaces for Relation Extraction

    IJCAI 2016

    In this paper, we present a novel approach for relation extraction using only term
    pairs as the input without textual features. We aim to build a single joint space for each
    relation which is then used to produce relation specific term embeddings. The proposed method fits particularly well for domains in which similar arguments are often associated with similar relations. It can also handle the situation when the labeled data is limited. The proposed method is evaluated both theoretically…

    In this paper, we present a novel approach for relation extraction using only term
    pairs as the input without textual features. We aim to build a single joint space for each
    relation which is then used to produce relation specific term embeddings. The proposed method fits particularly well for domains in which similar arguments are often associated with similar relations. It can also handle the situation when the labeled data is limited. The proposed method is evaluated both theoretically with a proof for the closed-form solution and experimentally with promising results on both DBpedia and medical relations.

    Other authors
    See publication
  • Poker-cnn: A pattern learning strategy for making draws and bets in poker games using convolutional networks

    AAAI-2016

    Poker is a family of card games that includes many variations. We hypothesize that most poker games can be solved as a pattern matching problem, and propose creating a strong poker playing system based on a unified poker representation. Our poker player learns through iterative self-play, and improves its understanding of the game by training on the results of its previous actions without sophisticated domain knowledge. We evaluate our system on three poker games:…

    Poker is a family of card games that includes many variations. We hypothesize that most poker games can be solved as a pattern matching problem, and propose creating a strong poker playing system based on a unified poker representation. Our poker player learns through iterative self-play, and improves its understanding of the game by training on the results of its previous actions without sophisticated domain knowledge. We evaluate our system on three poker games: single player video poker, two-player Limit Texas Hold’em, and finally two-player 2-7 triple draw poker. We show that our model can quickly learn patterns in these very different poker games while it improves from zero knowledge to a competitive player against human experts. The contributions of this paper include: (1) a novel representation for poker games, extendable to different poker variations, (2) a Convolutional Neural Network (CNN) based learning model that can effectively learn the patterns in three different games, and (3) a self-trained system that significantly beats the heuristic-based program on which it is trained, and our system is competitive against human expert players.

    Other authors
    See publication
  • Medical Relation Extraction with Manifold Models

    ACL

    In this paper, we present a manifold model for medical relation extraction. Our model
    is built upon a medical corpus containing 80M sentences (11 gigabyte text) and designed
    to accurately and efficiently detect the key medical relations that can facilitate
    clinical decision making. Our approach integrates domain specific parsing and typing
    systems, and can utilize labeled as well as unlabeled examples. To provide users
    with more flexibility, we also take label weight into…

    In this paper, we present a manifold model for medical relation extraction. Our model
    is built upon a medical corpus containing 80M sentences (11 gigabyte text) and designed
    to accurately and efficiently detect the key medical relations that can facilitate
    clinical decision making. Our approach integrates domain specific parsing and typing
    systems, and can utilize labeled as well as unlabeled examples. To provide users
    with more flexibility, we also take label weight into consideration. Effectiveness
    of our model is demonstrated both theoretically with a proof to show that the solution
    is a closed-form solution and experimentally with positive results in experiments.

    Other authors
    • Chang Wang
    See publication
  • Advances in automated knowledge base construction

    SIGMOD Records Journal

    Recent years have seen significant advances on the creation of large-scale knowledge bases (KBs). Extracting knowledge from Web pages, and integrating it into a coherent KB is a task that spans the areas of natural language processing, information extraction, information integration, databases, search and machine learning. Some of the latest developments in the field were presented at the AKBC-WEKEX workshop on knowledge extraction at the NAACL-HLC 2012 conference. This workshop included 23…

    Recent years have seen significant advances on the creation of large-scale knowledge bases (KBs). Extracting knowledge from Web pages, and integrating it into a coherent KB is a task that spans the areas of natural language processing, information extraction, information integration, databases, search and machine learning. Some of the latest developments in the field were presented at the AKBC-WEKEX workshop on knowledge extraction at the NAACL-HLC 2012 conference. This workshop included 23 accepted papers, and 11 keynotes by senior researchers. The workshop had speakers from all major search engine providers, government institutions, and the leading universities in the field. In this survey, we summarize the papers, the keynotes, and the
    discussions at this workshop.

    Other authors
    See publication
  • Analysis of watson's strategies for playing Jeopardy!

    Journal of Artificial Intelligence Research

    Major advances in Question Answering technology were needed for IBM Watson1
    to play Jeopardy!2 at championship level – the show requires rapid-fire answers to challenging
    natural language questions, broad general knowledge, high precision, and accurate confi-
    dence estimates. In addition, Jeopardy! features four types of decision making carrying
    great strategic importance: (1) Daily Double wagering; (2) Final Jeopardy wagering; (3)
    selecting the next square when in control of the…

    Major advances in Question Answering technology were needed for IBM Watson1
    to play Jeopardy!2 at championship level – the show requires rapid-fire answers to challenging
    natural language questions, broad general knowledge, high precision, and accurate confi-
    dence estimates. In addition, Jeopardy! features four types of decision making carrying
    great strategic importance: (1) Daily Double wagering; (2) Final Jeopardy wagering; (3)
    selecting the next square when in control of the board; (4) deciding whether to attempt
    to answer, i.e., “buzz in.” Using sophisticated strategies for these decisions, that properly
    account for the game state and future event probabilities, can significantly boost a player’s
    overall chances to win, when compared with simple “rule of thumb” strategies.
    This article presents our approach to developing Watson’s game-playing strategies,
    comprising development of a faithful simulation model, and then using learning and MonteCarlo
    methods within the simulator to optimize Watson’s strategic decision-making. After
    giving a detailed description of each of our game-strategy algorithms, we then focus in
    particular on validating the accuracy of the simulator’s predictions, and documenting performance
    improvements using our methods. Quantitative performance benefits are shown
    with respect to both simple heuristic strategies, and actual human contestant performance
    in historical episodes. We further extend our analysis of human play to derive a number of
    valuable and counterintuitive examples illustrating how human contestants may improve
    their performance on the show.

    Other authors
    • Gerald Tesauro
    • David C Gondek
    • Jonathan Lenchner
    • John M Prager
    See publication
  • A comparison of hard filters and soft evidence for answer typing in watson

    International Semantic Web Conference

    Questions often explicitly request a particular type of answer. One
    popular approach to answering natural language questions involves filtering candidate
    answers based on precompiled lists of instances of common answer types
    (e.g., countries, animals, foods, etc.). Such a strategy is poorly suited to an open
    domain in which there is an extremely broad range of types of answers, and the
    most frequently occurring types cover only a small fraction of all answers. In
    this paper we…

    Questions often explicitly request a particular type of answer. One
    popular approach to answering natural language questions involves filtering candidate
    answers based on precompiled lists of instances of common answer types
    (e.g., countries, animals, foods, etc.). Such a strategy is poorly suited to an open
    domain in which there is an extremely broad range of types of answers, and the
    most frequently occurring types cover only a small fraction of all answers. In
    this paper we present an alternative approach called TyCor, that employs soft filtering
    of candidates using multiple strategies and sources. We find that TyCor
    significantly outperforms a single-source, single-strategy hard filtering approach,
    demonstrating both that multi-source multi-strategy outperforms a single source,
    single strategy, and that its fault tolerance yields significantly better performance
    than a hard filter.

    Other authors
    See publication
  • Automatic knowledge extraction from documents

    IBM Journal of Research and Development

    Access to a large amount of knowledge is critical for success at
    answering open-domain questions for DeepQA systems such as
    IBM Watsoni. Formal representation of knowledge has the
    advantage of being easy to reason with, but acquisition of structured
    knowledge in open domains from unstructured data is often difficult
    and expensive. Our central hypothesis is that shallow syntactic
    knowledge and its implied semantics can be easily acquired and can
    be used in many areas of a…

    Access to a large amount of knowledge is critical for success at
    answering open-domain questions for DeepQA systems such as
    IBM Watsoni. Formal representation of knowledge has the
    advantage of being easy to reason with, but acquisition of structured
    knowledge in open domains from unstructured data is often difficult
    and expensive. Our central hypothesis is that shallow syntactic
    knowledge and its implied semantics can be easily acquired and can
    be used in many areas of a question-answering system. We take a
    two-stage approach to extract the syntactic knowledge and implied
    semantics. First, shallow knowledge from large collections of
    documents is automatically extracted. Second, additional
    semantics are inferred from aggregate statistics of the
    automatically extracted shallow knowledge. In this paper, we
    describe in detail what kind of shallow knowledge is extracted,
    how it is automatically done from a large corpus, and how
    additional semantics are inferred from aggregate statistics. We
    also briefly discuss the various ways extracted knowledge is used
    throughout the IBM DeepQA system.

    Other authors
    See publication
  • Finding needles in the haystack: Search and candidate generation

    IBM Journal of Research and Development

    A key phase in the DeepQA architecture is Hypothesis Generation, in
    which candidate system responses are generated for downstream
    scoring and ranking. In the IBM Watsoni system, these hypotheses
    are potential answers to Jeopardy!i questions and are generated by
    two components: search and candidate generation. The search
    component retrieves content relevant to a given question from
    Watson’s knowledge resources. The candidate generation component
    identifies potential answers…

    A key phase in the DeepQA architecture is Hypothesis Generation, in
    which candidate system responses are generated for downstream
    scoring and ranking. In the IBM Watsoni system, these hypotheses
    are potential answers to Jeopardy!i questions and are generated by
    two components: search and candidate generation. The search
    component retrieves content relevant to a given question from
    Watson’s knowledge resources. The candidate generation component
    identifies potential answers to the question from the retrieved
    content. In this paper, we present strategies developed to use
    characteristics of Watson’s different knowledge sources and to
    formulate effective search queries against those sources. We further
    discuss a suite of candidate generation strategies that use various
    kinds of metadata, such as document titles or anchor texts in
    hyperlinked documents. We demonstrate that a combination of these
    strategies brings the correct answer into the candidate answer
    pool for 87.17% of all the questions in a blind test set, facilitating
    high end-to-end question-answering performance.

    Other authors
    See publication
  • Learning to rank for robust question answering

    Proceedings of the 21st ACM international conference on Information and knowledge management

    This paper aims to solve the problem of improving the ranking of
    answer candidates for factoid based questions in a state-of-the-art
    Question Answering system. We first provide an extensive comparison
    of 5 ranking algorithms on two datasets – from the Jeopardy
    quiz show and a medical domain. We then show the effectiveness
    of a cascading approach, where the ranking produced by one
    ranker is used as input to the next stage. The cascading approach
    shows sizeable gains on both…

    This paper aims to solve the problem of improving the ranking of
    answer candidates for factoid based questions in a state-of-the-art
    Question Answering system. We first provide an extensive comparison
    of 5 ranking algorithms on two datasets – from the Jeopardy
    quiz show and a medical domain. We then show the effectiveness
    of a cascading approach, where the ranking produced by one
    ranker is used as input to the next stage. The cascading approach
    shows sizeable gains on both datasets. We finally evaluate several
    rank aggregation techniques to combine these algorithms, and find
    that Supervised Kemeny aggregation is a robust technique that always
    beats the baseline ranking approach used by Watson for the
    Jeopardy competition. We further corroborate our results on TREC
    Question Answering datasets.

    Other authors
    See publication
  • Textual resource acquisition and engineering

    IBM Journal of Research and Development

    A key requirement for high-performing question-answering (QA)
    systems is access to high-quality reference corpora from which
    answers to questions can be hypothesized and evaluated. However,
    the topic of source acquisition and engineering has received very
    little attention so far. This is because most existing systems were
    developed under organized evaluation efforts that included reference
    corpora as part of the task specification. The task of answering
    Jeopardy!i…

    A key requirement for high-performing question-answering (QA)
    systems is access to high-quality reference corpora from which
    answers to questions can be hypothesized and evaluated. However,
    the topic of source acquisition and engineering has received very
    little attention so far. This is because most existing systems were
    developed under organized evaluation efforts that included reference
    corpora as part of the task specification. The task of answering
    Jeopardy!i questions, on the other hand, does not come with such a
    well-circumscribed set of relevant resources. Therefore, it became
    part of the IBM Watsoni effort to develop a set of well-defined
    procedures to acquire high-quality resources that can effectively
    support a high-performing QA system. To this end, we developed
    three procedures, i.e., source acquisition, source transformation, and
    source expansion. Source acquisition is an iterative development
    process of acquiring new collections to cover salient topics deemed
    to be gaps in existing resources based on principled error analysis.
    Source transformation refers to the process in which information
    is extracted from existing sources, either as a whole or in part, and is
    represented in a form that the system can most easily use. Finally,
    source expansion attempts to increase the coverage in the content of
    each known topic by adding new information as well as lexical
    and syntactic variations of existing information extracted from
    external large collections. In this paper, we discuss the methodology
    that we developed for IBM Watson for performing acquisition,
    transformation, and expansion of textual resources. We demonstrate
    the effectiveness of each technique through its impact on candidate
    recall and on end-to-end QA performance.

    Other authors
    See publication
Join now to see all publications

Projects

Languages

  • Chinese

    Native or bilingual proficiency

  • English

    Native or bilingual proficiency

View James’ full profile

  • See who you know in common
  • Get introduced
  • Contact James directly
Join to view full profile

Other similar profiles

Explore top content on LinkedIn

Find curated posts and insights for relevant topics all in one place.

View top content

Others named James Fan in United States

Add new skills with these courses