Status : Verified
| Personal Name | Morado, El King, D. |
|---|---|
| Resource Title | Biitroot: A K-mer-Optimized Single Sequence and Batch Compression and Encryption Tool |
| Date Issued | 12 January 2026 |
| Abstract | The DNA sequence serves as the fundamental blueprint of life, offering potential for biological discovery through computational analysis. With the continuous decline in sequencing costs, genomic data are being generated at an unprecedented rate—attracting not only researchers but also potential threats from malicious actors. Sensitive genetic information can be exploited in various ways, such as discrimination based on disease predisposition or blackmail involving participation in genomic studies. These concerns highlight the urgent need for secure and efficient DNA data protection. While existing approaches utilize both general-purpose and DNA-specific encryption methods, encryption alone can impose substantial computational overhead, thereby underscoring the necessity of integrating compression techniques to enhance efficiency. This study introduces Biitroot, a novel approach that combines k-mer–based compression with Advanced Encryption Standard (AES) encryption. Compression efficiency is optimized by tuning k-mer lengths, encoding schemes, and key-handling mechanisms across organisms and sequence sizes. Biitroot achieves file size reductions of up to 75%, outperforming the 67% reduction achieved by the reference method, Cryfa. Furthermore, Biitroot supports batch compression with independent per-sample processing, allowing de-compression of individual files without accessing others. Additional capabilities include customizable AES modes, support for the extended DNA alphabet, and secure management of both compression and encryption keys. |
| Degree Course | MS Computer Science |
| Language | English |
| Keyword | DNA compression, genomic security, AES encryption, k-mer optimization, bioinformatics |
| Material Type | Thesis/Dissertation |
Preliminary Pages
5.21 Mb
Category : F - Regular work, i.e., it has no patentable invention or creation, the author does not wish for personal publication, there is no confidential information.
Access Permission : Open Access
