[Nov 23, 2009 14:09] Web access to Microsoft Live@edu accounts now works.
Table of Contents
GEOSS Components and Description
GEOSS has a relational database, a web interface, and an analysis suite. There is also a repository for each user's files, and a comprehensive security system. The gene expression workflow begins with a researcher describing an experimental protocol in a minimal fashion. When samples are ready, the researcher creates an order for the microarray research center which at the University of Virginia (UVa) is the Biomedical Research Facility (BRF). The BRF hybridizes the samples as specified by the researcher, and imports the resulting data into GeneX. At this stage the data is available to the researcher. Data can be exported and/or analyzed. Our Analysis Tree package allows users to build a flow chart (yes, there is a graphical flow chart on the screen) of routines to analyze data and produce reports. Except for system administrators, all interaction with the system is via secure web pages. No special software is required for end users. All data is warehoused on powerful, secure servers.
Data is loaded by curators (personnel in the microarray center) via a web interface. Data is visible only to the researcher who owns it, unless the researcher explicitly allows other permissions. The security system allows all users to create groups, to control group membership, and to enable/disable group read and/or write permissions. Permissions apply separately to studies/experimental conditions, orders, data, and derived data (files).
Our Analysis Tree allows the user to build a graphical representation of the flow of data through various modules of an analysis. This system will be documented elsewhere.
The servers are running Fedora Linux. GEOSS is written in Perl. Most of the analysis routines are written in R, some in C, and some in Perl. We use PostgreSQL as our relational database. Its abilities to do transactions, and its high availability were critical to the project. Users must login to the system. All the web pages are accessed via SSL so that all data traveling between the user's web browser and the server is secure from eavesdropping.
Several Perl modules that aren't standard in Perl 5.8 are required. R is required, and of course Apache. All of the Perl used by GEOSS amounts to only 11,000 lines of code. GEOSS is compact. Installation has been vastly simplified and should only take a few hours.
While GEOSS is theoretically portable to Windows NT, 2000, or XP, there are a few aspects that would be nontrivial.