Parallel suffix array construction algorithms for pc-cluster

dc.contributor.authorJun Lee, Kok
dc.date.accessioned2015-07-30T05:28:14Z
dc.date.available2015-07-30T05:28:14Z
dc.date.issued2004
dc.description.abstractSuffix array (lvlanber & Myers. 1993) is the most compact and simple jidl rexr indices. It is widely used for fast and detail queries over large databases. The only draw back of sutlix array is the time consumed during its construction phase. Due to this problem, many specific algorithms have been designed to overcome the computation bottleneck. Another option to solve this problem effectively is to utilize parallel processing. The intent of this thesis is to design parallel suffix array construction algorithms for low cost network PC-cluster. Two parallel algorithms are proposed and the performance investigated. Firstly, the effort focus on proposing an algorithm that always performs weii for practical input string that is independent of bandwidth issues. A parallel suffix sorting algorithm called Steady algorithm that involved no communication is proposed. In the implementation of steady algorithm, some unrelated communication is involved. This unrelated communication is created by message passing library for tasks spawning and user I/0 transfer which are not the part of the algorithm. To the best of our knowledge, this algorithm is the first suffix array constntction algorithm that involves no communication. The experimental result we obtained shows the Steady algorithm performs well under low bandwidth PC-cluster. Our second attempt in this thesis is to improve the sorting time in the Steady algorithm. Another parallel suffix sorting algorithm called Parallel Two-stage algorithm is proposed. We adopt a specific suffix sorting technique of Itoh & Tanaka (1999) Two-Stage algorithm. We experimentally show that this algorithm improved the timing of Steady algorithm under a limited number of processors. Besides, we show that Parallel Two-Stage algorithm obtained a satisfactory speed up using a limited number of processors, which is not achieved by the specific algorithm of Kitajima & Navarro (1999).en_US
dc.identifier.urihttp://hdl.handle.net/123456789/1051
dc.language.isoenen_US
dc.subjectArray constructionen_US
dc.subjectAlgorithmsen_US
dc.titleParallel suffix array construction algorithms for pc-clusteren_US
dc.typeThesisen_US
Files
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: