Construction-of-CSA-Compressed-Suffix-Array-

A repository about construction of compressed suffix array (bio-info technology)

Usage manual: modify main.c to change input file path, output file path, output file header.

details:

char* FILEPATH = "testdata/testdata_1000.fna";   // file path
int ARRAYLENGTH = 0; // length of T ~ n
int PARTLENGTH = 0; // part length of T ~ l
int PARTNUM = 0; // number of parts ~ ceil(n/l)

char* T = NULL; // DNA sequence of (A,C,G,T) plus a '$'
int* SA = NULL; // SA of T
int* SA_inverse = NULL; // inverse of SA
int* Psi = NULL; // Psi of T - the compressed suffix array

char* BWT = NULL; // BWT of T - Burrows-Wheeler Transform

char* BWTFILEPATH = "outputdata/testdata_1000.bwt";
char* BWTFILEHEADER = ">gi|110640213|ref|NC_008253.1| Escherichia coli 536, bwt array";
int LINELENGTH = 70;

FILEPATH ~ input file path

BWTFILEPATH ~ output file path

BWTFILEHEADER ~ output file header

Afterthoughts

the importance of this algorithm is not about the time complexity. The point is this algorithm reduces the peak memory usage from O(nlog n) to O(n), which will be O(nlog n) if the CSA is built directly.
...

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
bin/Debug		bin/Debug
obj/Debug		obj/Debug
outputdata		outputdata
testdata		testdata
BasicStep.h		BasicStep.h
Bio-info Tech Experiment 1.cbp		Bio-info Tech Experiment 1.cbp
Bio-info Tech Experiment 1.depend		Bio-info Tech Experiment 1.depend
Bio-info Tech Experiment 1.layout		Bio-info Tech Experiment 1.layout
FileOperation.h		FileOperation.h
HelperFunction.h		HelperFunction.h
MergeStep.h		MergeStep.h
README.md		README.md
SABuildFunc.h		SABuildFunc.h
SimpleTest.h		SimpleTest.h
headfiletest.h		headfiletest.h
main.c		main.c
package-info.info		package-info.info

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Construction-of-CSA-Compressed-Suffix-Array-

About

Releases

Packages

Contributors 2

Languages

AtoshDustosh/Construction-of-CSA-Compressed-Suffix-Array

Folders and files

Latest commit

History

Repository files navigation

Construction-of-CSA-Compressed-Suffix-Array-

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages