Abstract:
Different ad selection techniques may be evaluated and compared by (i) combining ads generated using at least two different techniques and (ii) determining performance statistics of these combined (e.g., interleaved) advertising serves over time. The relative performance of the different techniques can then be determined. These principles can also be used to gauge different ad scoring techniques. These principles can also be used to gauge different ad rendering techniques.
Abstract:
Methods and systems allow users to enter natural language terms that describe a particular web site into an address field of a browser instead of a formal URL. The terms are evaluated to determine whether they correspond, with a high likelihood, to a particular web site. If so, this web site may be immediately accessed. If not, a list of search results based on the terms may be displayed by the browser.
Abstract:
A system and method for providing a user interface with search query broadening is described. A query defining a search executable on potentially retrievable information is accepted. The query is parsed. A broadened query is provided. At least one of the broadened query and results of a search executed on the broadened query is presented.
Abstract:
Systems and techniques relating to ranking search results of a search query include, in general, subject matter that can be embodied in a computer-implemented method that includes determining a measure of relevance for a document result within a context of a search query for which the document result is returned, the determining being based on a first number in relation to a second number, the first number corresponding to longer views of the document result, and the second number corresponding to at least shorter views of the document result; and outputting the measure of relevance to a ranking engine for ranking of search results, including the document result, for a new search corresponding to the search query. The subject matter described in this specification can also be embodied in various corresponding computer program products, apparatus and systems.
Abstract:
A system may track statistics for a number of features using an approximate counting technique by: subjecting each feature to multiple, different hash functions to generate multiple, different hash values, where each of the hash values may identify a particular location in a memory, and storing statistics for each feature at the particular locations identified by the hash values. The system may generate rules for a model based on the tracked statistics.
Abstract:
Systems and methods that improve search rankings for a search query by using data associated with queries related to the search query are described. In one aspect, a search query is received, a population associated with the search query is determined, an article (such as a webpage) associated with the search query is determined, and a ranking score for the article based at least in part on data associated with the population is determined. Algorithms and types of data associated with a population useful in carrying out such systems and methods are described.
Abstract:
A stopword detection component detects stopwords (also stop-phrases) in search queries input to keyword-based information retrieval systems. Potential stopwords are initially identified by comparing the terms in the search query to a list of known stopwords. Context data is then retrieved based on the search query and the identified stopwords. In one implementation, the context data includes documents retrieved from a document index. In another implementation, the context data includes categories relevant to the search query. Sets of retrieved context data are compared to one another to determine if they are substantially similar. If the sets of context data are substantially similar, this fact may be used to infer that the removal of the potential stopword(s) is not material to the search. If the sets of context data are not substantially similar, the potential stopword can be considered material to the search and should not be removed from the query.
Abstract:
A method may include receiving a document and an initial score for the document; determining that there has been a decrease in a rate or quantity of new links that point to the document over time; classifying the document as stale in response to the determining; decreasing the initial score for the document, resulting in an updated score; and ranking the document with regard to at least one other document based, at least in part, on the score.
Abstract:
A system may determine an extent to which a document is selected when the document is included in a set of search results, generate a score for the document based, at least in part, on the extent to which the document is selected when the document is included in a set of search results; and rank the document with regard to at least one other document based, at least in part, on the score.
Abstract:
A system receives a search query from a user and searches a group of repositories, based on the search query, to identify, for each of the repositories, a set of search results. The system also identifies one of the repositories based on a likelihood that the user desires information from the identified repository and presents the set of search results associated with the identified repository.