Invention Grant
- Patent Title: Web crawler scheduler that utilizes sitemaps from websites
- Patent Title (中): Web爬网程序调度程序利用网站的站点地图
-
Application No.: US13271160Application Date: 2011-10-11
-
Publication No.: US08417686B2Publication Date: 2013-04-09
- Inventor: Sascha B. Brawer , Maximilian Ibel , Ralph Michael Keller , Narayanan Shivakumar
- Applicant: Sascha B. Brawer , Maximilian Ibel , Ralph Michael Keller , Narayanan Shivakumar
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Morgan, Lewis & Bockius LLP
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/30

Abstract:
Methods and systems for a web crawler scheduler that utilizes sitemaps from websites are described. A web crawler scheduling system receives a notification from a website or web server. In response to the notification, the system accesses one or more sitemap(s) for documents associated with the website or web server. The system schedules crawls of the documents based on information identified from the sitemaps. The system crawls at least a subset of the documents scheduled for crawling.
Public/Granted literature
- US20120036118A1 Web Crawler Scheduler that Utilizes Sitemaps from Websites Public/Granted day:2012-02-09
Information query