Jonathan Stern
13 Papers
662 Citations
Jonathan Stern is an academic researcher. The author has contributed to research in topics: Web page & Static web page. The author has an hindex of 7, co-authored 13 publications.
Chat about Author
Papers
Patent
Computer method and apparatus for extracting data from web pages
Michel Decary,Jonathan Stern,Kosmas Karadimitriou,Jeremy W. Rothman-Shore +3 more
- 25 Jul 2001
TL;DR: In this article, a computer method and apparatus for extracting information from a Web page is described, which is formed of an extractor coupled to receive Web pages from a source. But the extractor uses natural language processing to extract desired information from the Web page.
187
Patent
Computer method and apparatus for collecting people and organization information from Web sites
Jonathan Stern,Kosmas Karadimitriou,Jeremy W. Rothman-Shore,Michel Decary +3 more
- 30 Mar 2001
TL;DR: In this article, a Web site of potential interest is accessed and a subset of web pages from the accessed site are determined for processing. But, according to types of contents found on a subject Web page, extraction of people and organization information is enabled.
141
Patent
Method for maintaining people and organization information
Jonathan Stern,Jeremy W. Rothman-Shore,Kosmas Karadimitriou,Michel Decary +3 more
- 27 Jul 2001
TL;DR: In this paper, the authors present a method that provides continual updates to the information stored in the database by the people named by the automated means and by the means of a link from the invention database to a third party data system.
111
Patent
Data mining system
Jonathan Stern,Jeremy W. Rothman-Shore,Kosmas Karadimitriou,Michel Decary +3 more
- 30 Jul 2001
TL;DR: In this paper, a computer automated method and system mines from a global computer network information about people and organizations, including automated crawling means, a distributor controlling the crawling means processing, an extractor storing extracted information of interest in a database, an integrator and post-processor.
103
Patent
Computer method and apparatus for determining content types of web pages
Kosmas Karadimitriou,Jonathan Stern,Michel Decary,Jeremy W. Rothman-Shore +3 more
- 17 Jul 2001
TL;DR: In this paper, a predefined set of potential content types of a subject Web page is first provided, and then a Bayesian network combines the test results to provide indications of the types of contents detected on the subject web page.
52