|
Data Integration in Proteomics
Gyorgy Babnigg
Argonne National Laboratory, Biosciences Division
Hosted by Ray Bair
10:30 AM, November 2, 2004
Building 21, Room A261
|
| Abstract |
My presentation will cover two topics: (1) issues concerning data
integration in proteomics and (2) bioinformatics in proteomics. Our
laboratory uses two-dimensional gel electrophoresis (2DE) coupled with
tandem mass spectrometry (MS-MS) to study the proteomes of a diverse set
of species, including microbes with potential use in bioremediation. This
approach produces diverse and complex data types that makes data
integration, data analysis, and interrogation challenging. Some of the Web
applications built for 2DE and MS-MS data input and data analyses will be
described. In addition, a new non-redundant protein sequence database will
be presented. This database integrates publicly available data and private
data with versioning capability. A unique identifier (SEGUID) allows
seamless integration of experimental data and sequence attributes. An
ORACLE-based cluster computing approach with the use of a local Linux
cluster (BIO-RC) and other potential use of Linux clusters on-site (Chiba
City and Jazz cluster) will be discussed.
|