Par-PSSE: Software for Pairwise Statistical Significance Estimation in Parallel for Local Sequence Alignment

Yuhong Zhang; Patwary, Mostofa Ali; Misra, Sanchit; Agrawal, Ankit; Wei-keng Liao; Zhiguang Qin; Choudhary, Alok
March 2012
International Journal of Digital Content Technology & its Applic;Mar2012, Vol. 6 Issue 5, p200
Academic Journal
Pairwise statistical significance (PSS) has been recognized as a very useful method for homology detection. It can help in estimating whether the output of sequence alignment is evolutionarily link or just arisen by accident. However, pairwise statistical significance estimation (PSSE) poses a big challenge in terms of performance and scalability since it is both computationally intensive and data intensive to construct the empirical score distribution during the estimation. This paper presents a software library for estimating pairwise statistical significance in parallel, named Par-PSSE, implemented in C++ using OpenMP, MPI paradigms and their hybrids. Further, we apply the parallelization technique to estimate non-conservative PSS using standard, sequence-specific, and position-specific substitution matrices. These extensions have been found superior compared to the standard pairwise statistical significance in term of retrieval accuracy. Through distributing the compute-intensive kernels of the pairwise statistical significance estimation across multiple computational units, we achieve a speedup of up to 621.73x over the corresponding sequential implementation when using1024 cores.


Related Articles

  • A model for communication between resource discovery and load balancing units in computing environments. Arab, Mohammad; Sharifi, Mohsen // Journal of Supercomputing;Jun2014, Vol. 68 Issue 3, p1538 

    Resource overloading causes one of the main challenges in computing environments. In this case, a new resource should be discovered to transfer the extra load. However, this results in drastic performance degradation. Thus, it is of high importance to discover the appropriate resource at first....

  • The Human Element of IT. Dowse, Chris; Hertz, Paul // CIO Insight;Jun2009, Issue 105, p26 

    The article presents the study which examines the effects of human elements to the information technology (IT) management in the U.S. The study was conducted to IT executives, managers and other functionaries in the different IT shops, which points out human motivations that dominantly affect...

  • Early Estimation of Back-End Software Development Effort. Mishra, Samaresh; Pattnaik, Prasant Ku.; Mall, Rajib // International Journal of Computer Applications;Nov2011, Vol. 33, p6 

    An appropriate cost metrics for estimating development effort of the database part of an application at conceptual design stage using ER model is an important consideration. We propose four cost metrics based on the level of complexity of an ER diagram. Our effort estimation technique is based...

  • Combining spatial and temporal patches for scalable video indexing. Piro, Paolo; Anthoine, Sandrine; Debreuve, Eric; Barlaud, Michel // Multimedia Tools & Applications;May2010, Vol. 48 Issue 1, p89 

    This paper tackles the problem of scalable video indexing. We propose a new framework combining spatial and motion patch descriptors. The spatial descriptors are based on a multiscale description of the image and are called Sparse Multiscale Patches. We propose motion patch descriptors based on...

  • EMC CAPTIVA ALMOST THREE TIMES FASTER THAN KEY COMPETITOR.  // Productivity Software;Dec2010, Vol. 23 Issue 12, p4 

    The article reports that EMC Corp. in Hopkinton, Massachusetts has demonstrated the market-leading performance, scalability, manageability and modularity of its EMC Captiva intelligent enterprise capture solution. It reveals that Captiva outperformed the competition on the basis of processing...

  • Social Software in the Enterprise. Byrne, Tony // AIIM E-DOC;May/Jun2008, Vol. 22 Issue 3, p64 

    The article discusses the social and organizational aspect of social software in the enterprise. Social software offers a wide range of tools which help in giving emphasis on individual empowerment and more ad-hoc information organization and management. Enterprises are gradually adopting such...

  • Prediction Capability Analysis of Two and Three Parameters Software Reliability Growth Models. Prince Williams, D. R. // Information Technology Journal;2006, Vol. 5 Issue 6, p1048 

    This study aims to analyze the predict capability of some of the popular Software Reliability growth models to understand the potential of having two or three parameters to facilitate the estimation process. The predict validity analysis will be on two key factors, one pertaining to the degree...

  • Constructive Cost Model.  // Network Dictionary;2007, p120 

    A definition of the term "Constructive Cost Model" (COCOMO) is presented. It refers to a method of estimating the number of man-months it will take to develop a software product. COCOMO has three levels: basic COCOMO, intermediate COCOMO and detailed COCOMO.

  • SensSB: a software toolbox for the development and sensitivity analysis of systems biology models. Rodriguez-Fernandez, Maria; Banga, Julio R. // Bioinformatics;Jul2010, Vol. 26 Issue 13, p1675 

    Summary: SensSB (Sensitivity Analysis for Systems Biology) is an easy to use, MATLAB-based software toolbox, which integrates several local and global sensitivity methods that can be applied to a wide variety of biological models. In addition to addressing the sensitivity analysis problem,...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics