Patent
Web crawling method and server
Cheng Zhi Feng,Qiu Bai Yu +1 more
- 20 Jun 2019
2
TL;DR: In this paper, a web page-grabbing method is presented, where a target web page on a website is grabbed, and the target web pages including a Web page corresponding to a Hypertext Markup Language 5 (H5) content and a non-H5 content are detected according to web page source code.
read more
Abstract: A web page grabbing method is provided. A target web page on a website is grabbed, the target web page including a web page corresponding to a Hypertext Markup Language 5 (H5) content and a web page corresponding to a non-H5 content. The web page corresponding to the H5 content is detected according to web page source code of the target web page. Dynamic rendering is performed on the web page corresponding to the H5 content, to obtain a rendered web page. Content details information corresponding to the H5 content is extracted from the rendered web page.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Patent
Self-adaptive web crawling and text extraction
Huang Chen-Yu,Lee Sheng-Wei,Lin June-Ray,Wu Ci-Hao,Yang Hsieh-Lung,Yu Ying-Chen +5 more
- 03 Oct 2019
TL;DR: In this paper, a method, computer system, and a computer program product for crawling and extracting main content from a web page is provided, which may include retrieving a HTML document associated with a Web page.
Patent
Web crawler recognition method and device and computer readable storage medium
Xiao Jun,Zuo Zina,Ou Huaigu,Wang Xiaoqing,Zhang Pan +4 more
- 18 Sep 2020
TL;DR: In this article, a web crawler recognition method and apparatus is described, and a computer readable storage medium is provided to detect web crawlers in a web-based network environment.
References
Patent
Multimedia redirection in a virtualized environment using a proxy server
Todd Giebler
- 10 Jan 2014
TL;DR: In this article, the authors describe methods and systems for multimedia redirection in a virtualized environment using a proxy server, where the proxy server may store scripting code that may be injected into web content retrieved from a content resource server.
33
Patent
Allocation and priority handling of uplink and downlink resources
Jan Lindskog,Andreas Andersson,Anders Ranheim,Pär Ankel +3 more
- 12 Feb 2008
TL;DR: In this article, a method and a telecommunication system for allocation and priority handling and a Node-B in the system enabling the method is presented, where the node-B (NB) 11,11,11',11'' monitors the quotient (Q) between DL data rate and UL data rate.
30
Patent
Dynamic network content grabbing method and dynamic network content crawler system
Zhang Zhenhui
- 16 Jan 2013
TL;DR: In this paper, a dynamic network content crawler system is proposed, which consists of the following steps: submitting an access request for a target network, and acquiring a target webpage comprising one or more dynamic contents; extracting the dynamic content in a specific area in the acquired target webpage; judging whether each extracted dynamic content exists in cache, if so, not processing dynamic content, and if not, advancing to the next step so as to grab the dynamic contents.
28
Patent
Method and system to select the highest speed server among web servers
Takayuki Kushida,Tatsuo Miyazawa +1 more
- 25 Sep 2001
TL;DR: In this article, a content request is issued by a client to a plurality of web servers, to which the client is connected via a network, each of the plurality of Web servers having the same requested content, the web servers is selected that can provide the requested content for the client at the highest speed.
22
Patent
Web crawler system with page-rendering function and implementation method thereof
Bin Huang
- 11 May 2011
TL;DR: In this paper, a web crawler system with a page-rendering function is described, which can be used for performing page rendering on a web page directly, and then keep the rendering results directly in a picture format.
19
Related Papers (5)
Yu Bo
- 20 Apr 2016
Chunhe Chen,Canming Peng,Chaofei Tian,Hongyan Zhao +3 more
- 09 Jun 2010
Shen Yanghong,Jin Zhengbao,Zhang Xingting +2 more
- 24 Dec 2014
Meng Zhiping,Haifeng Guo +1 more
- 02 Dec 2009