Pei Yue
Microsoft
3 Papers
28 Citations
Pei Yue is an academic researcher from Microsoft. The author has contributed to research in topics: Blocking (statistics) & Block (data storage). The author has an hindex of 3, co-authored 3 publications.
Chat about Author
Papers
Patent
Using core words to extract key phrases from documents
TL;DR: In this paper, a document's key phrases are extracted from a document based upon core words in that document (e.g., the words most relevant to the document) and various relevance features of each candidate word may be used to score and rank the candidate words relative to one another and thereby determine the core word or core words.
16
Patent
Record linkage based on a trained blocking scheme
Yunbo Cao,Chin-Yew Lin,Pei Yue,Zhiyuan Chen +3 more
- 13 Feb 2012
TL;DR: In this article, the authors provide techniques and arrangements to train a blocking scheme using both labeled data and unlabeled data, which can be used by a search engine when searching for records that match an entity.
12
Leveraging unlabeled data to scale blocking for record linkage
Yunbo Cao,Zhiyuan Chen,Jiamin Zhu,Pei Yue,Chin-Yew Lin,Yong Yu +5 more
- 16 Jul 2011
TL;DR: The experimental results show that using unlabeled data in learning can remarkably reduce the number of candidate matches while keeping the same level of coverage for true matches.