Patent
Web crawler optimization system
Gurudatta Horantur Shivaswamy,Gaurav Kukal,Jaino Joseph,Greeshma Katipally +3 more
- 11 Dec 2013
11
TL;DR: In this article, techniques for optimizing the performance of a web crawler are described, and a capacity of the crawler to fulfill uniform resource locator (URL) crawl requests for an upcoming given time period is estimated, based on the historical Web crawler performance data.
read more
Abstract: Techniques for optimizing the performance of a webpage crawler are described. According to various embodiments, historical web crawler performance data is accessed, the data describing a performance of a web crawler during various time periods in one or more prior days. A capacity of the web crawler to fulfill uniform resource locator (URL) crawl requests for an upcoming given time period is then estimated, based on the historical web crawler performance data. Thereafter, a plurality of URL crawl requests are distributed to the web crawler during the upcoming given time period, based on the estimated capacity of the web crawler.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Patent
Data refining engine for high performance analysis system and method
Satyanarayana Rao Kalikivayi,Mohammed J. Zahoor,Sanjay Parthasarathy +2 more
- 25 Jul 2013
TL;DR: In this article, price and product attributes from webpages are analyzed over time to identify price changes specific to products on individual webpages and for products across all webpages as well as to identify longitudinal correlations between price changes and attributes.
10
Patent
Optimization method of distributed vertical crawler service system
Yan Feng,Li Guibing,Wei Jichao +2 more
- 20 Jan 2016
TL;DR: In this article, an optimization method of a distributed vertical crawler service system is presented, where the original crawler services system is split into two parts of download service and page analysis logic, and a task queue is also split into a download task queue and an analysis task queue.
10
Patent
Distributed system for large volume deep web data extraction
Jason Crabtree,Andrew Sellers +1 more
- 02 Jan 2017
TL;DR: A distributed system for large volume deep web data extraction that is extremely scalable, allows multiple heterogeneous concurrent searches, has power web scrape result processing capabilities and uses a well defined, highly customizable, simplified, search agent configuration interface requiring minimal specialized programming knowledge is presented in this paper.
8
Patent
Monitoring performance of a computer system
Sanjay Sachdev,Easton Alexander,Peng Sean +2 more
- 28 Dec 2017
TL;DR: In this paper, a technique for monitoring performance of a computer system is presented, where bucket data is stored that indicates that multiple buckets are associated with a particular type of request. But the assignment may be further based on a complexity determined for each request.
6
Patent
Automated Configuration Data Collection for Business Applications Using Feedback
Joel W. Branch,Karin Murthy,Larisa Shwartz,Maja Vukovic +3 more
- 15 Sep 2014
TL;DR: In this article, a data collection method including collecting, by a configuration collector manager, configuration data, including configuration properties, from a plurality of data sources, creating, by model discovery component, a business application model using the configuration data collected by the configuration manager, a collecting, from an application model analysis user interface, edits and confirmations associated with the business application application model, and a analyzing, by feedback analyzer component, the edits and confirmsations associated to the application model and prioritizing the configuration properties based on the data sources.
5
References
Patent
Systems and methods for managing resource utilization in information management environments
Roger K. Richter,Chaoxin Qiu,Scott C. Johnson +2 more
- 05 Apr 2002
TL;DR: Resource usage accounting may be implemented in information management environments using resource utilization values as mentioned in this paper, where run time enforcement of system operations on one or more subsystems or processing engines of an information management system, such as a content delivery system, to advantageously provide intelligent admission control in a distributed environment.
672
Patent
A method for adaptive data/content insertion in mpeg2 transport streams
Kavitha Vallari Devara
- 19 Mar 2002
TL;DR: In this paper, an adaptive data insertion mechanism conducts future available bandwidth prediction/estimation by analyzing recent bandwidth in the transport stream which is consumed by general programs and inserts data by replacement of selected packets within the transportstream.
75
Patent
Method for controlling a management computer
Akihiko Yamaguchi,Atsushi Hatakeyama +1 more
- 21 Dec 2005
TL;DR: In this article, the authors propose a method for controlling a management computer connected to a server for permitting communications therebetween, wherein the server transmits to a client the result of processing executed in response to each processing request sent from the client.
74
Patent
Queuing system, method and computer program
Matt King
- 13 Nov 2006
TL;DR: In this paper, the authors present a method for managing requests for service over a communications network, which comprises the steps of: receiving a request for service from a customer terminal at a queue server via the communications network; allocating a queue identifier to the request for services; sending the queue identifiers to the customer terminal; receiving the queue identifier from the client terminal at the queue server as part of a subsequent request for servi cation; performing a comparison between queue identifier and queue status information; and forwarding the request to the service host in accordance with the result of the comparison.
26