About
database, data warehouse, data mining and statistics, cloud computing, big data.
Articles by Zheng
Activity
-
All models are wrong, but why and when can wrong models produce the right predictions? In Chapter 3 of Charting Reality, we look at how mechanistic…
All models are wrong, but why and when can wrong models produce the right predictions? In Chapter 3 of Charting Reality, we look at how mechanistic…
Liked by Zheng Shao
Experience
Education
Licenses & Certifications
Patents
-
Systems and methods of predicting resource usefulness using universal resource locators including counting the number of times URL features occur in training data
Issued US 7908234
A method, system and apparatus are provided to train a usefulness prediction model to generate a usefulness prediction in connection with a given universal resource locator (URL), the training of the usefulness prediction model being based on a training set of URLs and a count of negative URLs and a count of positive URLs identified by the training set, and for each feature extacted from the URLs in the training set, a count of the positive URLs in the training set that include the feature and…
A method, system and apparatus are provided to train a usefulness prediction model to generate a usefulness prediction in connection with a given universal resource locator (URL), the training of the usefulness prediction model being based on a training set of URLs and a count of negative URLs and a count of positive URLs identified by the training set, and for each feature extacted from the URLs in the training set, a count of the positive URLs in the training set that include the feature and a count of the negative URLs in the training set that include the feature. One or more features of the given URL are extracted, and the extracted features are used together with the usefulness prediction model to generate a usefulness prediction for the given URL.
Other inventorsSee patent
Honors & Awards
-
ICDE 2020 Ten-Year Influential Paper Award
IEEE
Hive - A Petabyte Scale Data Warehouse Using Hadoop
Reference: http://tab.computer.org/tcde/icde_inf_paper.html -
TopCoder Open 2005 Onsite Finalist
TopCoder
Handle: haha
Reference: https://tco05.topcoder.com/tracks/algorithm
-
Competitive Programming Hall of Fame
-
https://cphof.org/profile/ioi:2617
-
TopCoder Open 2004 Onsite Finalist
TopCoder
Handle: haha
Reference: https://tco04.topcoder.com/tracks/algorithm
-
Outstanding Graduate (Top 2%)
Tsinghua University, Compute Science and Technology Department
Awarded to the top 3 graduates out of 160 in the department in 2003.
-
11th Place in ACM ICPC World Finals 2001
ACM ICPC
Team: Hurricane
Representing: Tsinghua University
Reference: https://icpc.global/community/results-2001 -
Champion, CUMCM (China Undergraduate Mathematical Contest in Modeling)
China Undergraduate Mathematical Contest in Modeling
Lead member of the 3-student team that won the first place among all college undergraduates in China.
Reference (in Chinese): https://wenku.baidu.com/view/34998f4669eae009581bec93.html
-
Gold Medal - IOI 1999 (International Olympiad in Informatics)
-
Global Rank: The 9th place
Reference: http://stats.ioinformatics.org/results/1999
Recommendations received
5 people have recommended Zheng
Join now to viewOther similar profiles
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top content