Abstract:
A similarity search initialization system includes a leaf selector to select a leaf of a suffix tree generated from a target string representing a target sequence. The selected leaf is associated with a prefix in the suffix tree having a longest match to a suffix of a query string representing a query. The system further includes a distance module to determine a distance between the query and a subsequence of the target sequence represented by a candidate substring of the target string. The candidate substring includes the prefix associated with the selected leaf. The determined distance is to provide an initial upper bound in a similarity search of the target sequence using the query.
Abstract:
A lowest common ancestor of a first data sequence and a second data sequence is determined. Based on the lowest common ancestor, symbols that differ between the first data sequence and the second data sequence are identified. A distance between the first data sequence and the second data sequence is determined based on the symbols.
Abstract:
Probable anomalies associated with at least one data metric may be detected across a series of windows of time series data by comparison of data to a threshold. An estimated probability of anomalies for each of the windows of time series data may be determined based on the detected probable anomalies and the threshold. The windows of time series data may be ranked based on the estimated probabilities. Probable anomalies associated with highest ranked windows of time series data may be output to a user.
Abstract:
Systems and methods of anomaly detection in data centers. An example method may include analyzing time series data for the data center by testing statistical hypotheses. The method may also include constructing upper and lower bounds based on the statistical hypotheses. The method may also include flagging anomalies in the time series data falling outside of the upper and lower bounds.
Abstract:
A method and apparatus are disclosed for identifying anomalies of a signal, by analyzing a signal using a frequency-based technique, analyzing results of the frequency-based analysis using a statistical analysis technique, determining one or more limits based on the statistical analysis, and comparing a frequency domain representation of the signal to the limits to identify anomalies of the signal.
Abstract:
Provided are, among other things, systems, methods and techniques for generating a representative data string. In one representative implementation: (a) starting data positions are identified within input strings of data values; (b) a subsequence of output data values is determined based on the data values at data positions determined with reference to the starting data positions within the input strings; (c) an identification is made as to which of the input strings have segments that match the subsequence of output data values, based on a matching criterion; (d) steps (a)-(c) are repeated for a number of iterations; and (e) the subsequences of output data values are combined across the iterations to provide an output data string, with the determination in step (b) for a current iteration being based on the identification in step (c) for a previous iteration.
Abstract:
To broadcast different types of transmission having different tiers of coverage in a wireless broadcast network, each base station processes data for a wide-area transmission in accordance with a first mode (or coding and modulation scheme) to generate data symbols for the wide-area transmission and processes data for a local transmission in accordance with a second mode to generate data symbols for the local transmission. The first and second modes are selected based on the desired coverage for wide-area and local transmissions, respectively. The base station also generates pilots and overhead information for local and wide-area transmissions. The data, pilots, and overhead information for local and wide-area transmissions are multiplexed onto their transmission spans, which may be different sets of frequency subbands, different time segments, or different groups of subbands in different time segments. More than two different types of transmission may also be multiplexed and broadcast.
Abstract:
An exhaust flow system for use with a turbofan jet engine that provides separate fan flow and core flow streams that are not mixed. The system includes a fan nozzle and a primary flow nozzle. The primary flow nozzle includes a downstream edge portion that is either beveled with one or more beveled surfaces, or that contains a curving edge surface or a combination of a beveled edge and a curved edge to help direct noise generated by the jet engine upwardly away from a ground surface during take-off and landing procedures. The primary exhaust nozzle can also be orientated with an elongated lip portion thereof formed by the beveled edge surface such that the lip potion is orientated between a top dead center and a bottom dead center position, to thus help direct noise away from a cabin area of a fuselage of a mobile platform during cruise conditions.
Abstract:
Probable anomalies associated with at least one data metric may be detected across a series of windows of time series data by comparison of data to a threshold. An estimated probability of anomalies for each of the windows of time series data may be determined based on the detected probable anomalies and the threshold. The windows of time series data may be ranked based on the estimated probabilities. Probable anomalies associated with highest ranked windows of time series data may be output to a user.
Abstract:
Improving data clustering stability. A computer accesses a first plurality of cluster groups comprising data. The computer then applies a clustering method to the first plurality of cluster groups while adjusting said first plurality of cluster groups to be in higher agreement between themselves, thereby generating a second plurality of cluster groups that is in higher agreement between themselves than the first plurality of cluster groups. The second plurality of cluster groups corresponds to the first plurality of cluster groups.