Invention Grant
- Patent Title: Compression of genomic data file
- Patent Title (中): 压缩基因组数据文件
-
Application No.: US13428794Application Date: 2012-03-23
-
Publication No.: US08972201B2Publication Date: 2015-03-03
- Inventor: Sharmila Shekhar Mande , Monzoorul Hague Mohammed , Anirban Dutta , Tungadri Bose
- Applicant: Sharmila Shekhar Mande , Monzoorul Hague Mohammed , Anirban Dutta , Tungadri Bose
- Applicant Address: IN Mumbai
- Assignee: Tata Consultancy Services Limited
- Current Assignee: Tata Consultancy Services Limited
- Current Assignee Address: IN Mumbai
- Agency: Barnes & Thornburg LLP
- Priority: IN3655/MUM/2011 20111224
- Main IPC: G06F19/22
- IPC: G06F19/22

Abstract:
Systems and methods for compression of a genomic data file are described herein. In one embodiment, genomic sequences, sequence headers, and quality sequences associated with a plurality of data streams provided in a genomic data file are identified. Each of the genomic sequences includes at least one of primary characters and secondary characters. Further, the secondary characters from each of the genomic sequences may be removed to obtain an intermediate genomic sequence file and a quality score corresponding to the secondary character may be modified in quality sequences to obtain an intermediate quality sequence file. Based on the intermediate genomic sequence file and the intermediate quality sequence file, a modified genomic sequence file and a modified quality sequence file, respectively are generated. A compressed genomic data file is obtained using at least the modified genomic sequence and the modified quality sequence.
Public/Granted literature
- US20130166518A1 Compression Of Genomic Data File Public/Granted day:2013-06-27
Information query