homeabout uscontact us
Bioinformatics
Software Consulting
Partners
Support
Press
Site Map
 

GeneLinker Feature List

GeneLinker Gold 4.6

NEW! Protein Biomarker Package integrated as part of all GeneLinker installations (previously the Protein Biomarker Package had to be installed as a separate package).

Data Import

  • Flexible, extensible and intuitive data import supporting averaging of within-chip replicates and Affymetrix P-values
  • Support for all types of text files, 2 color data and a variety of native chip formats
  • Currently there are 20 supported formats including: Affymetrix 4.0 and 5.0, GenePix, Genomic Solutions, CodeLink XML, Quantarray and ScanArray.
  • Binary Excel-format (.xls) tabular data.
  • Quantarray merge replicates and Quantarray Unicode support.
  • NEW! Improved Proteomics Import Scripts for use with the new NIH/NCI Center for Cancer Research Ovarian dataset 8-7-02 (see Protein Biomarker Discovery with GeneLinker Platinum for more information).
  • NEW! “Import P-Value Spectrum” script creates a dataset from the p-values and false-discovery rate. This allows exported p-values from ANOVA experiments to be re-imported, enabling novel visualization of channels that best distinguish between different classes (see Protein Biomarker Discovery with GeneLinker Platinum for more information).
  • NEW! Import scripts to handle Koadarray ratio and two-color data (including confidence values)
  • Additional formats can be added at any time

Pre-processing

  • Value removal by reliability measures and using simple expressions
  • Missing Value Estimation by central tendency, arbitrary value or nearest neighbors
  • Create variables easily within the application. Random variables can be created to assist with statistical analysis
  • Sample Filtering (based on variables)

Normalization Methods

  • Log (base 2, e and 10), Divide by Maximum, Min-Max Normalization, Standardization
  • Central tendency and Linear Regression
  • Lowess for 2-color data
  • Using Positive and Negative Control Genes

Filtering Methods

  • Gene lists
  • Maximum culling and Range culling
  • N-fold with N
  • N-fold with specified number of genes
  • Spotted array n-fold culling

Statistics

  • Summary statistics (histograms)
  • F-Test ANOVA for normally distributed data
  • Kruskal-Wallis non-parametric ANOVA
  • False Discovery Rate calculations
  • Bonferroni corrected p-values
  • Replicate merging based on an imported variable (allows multiple categorizations of samples)

Analysis

  • Partitional clustering of genes and/or samples using Self-Organizing Maps (SOMs), K-Means and Mutual Nearest Neighbors (Jarvis-Patrick) methods
  • Hierarchical clustering of genes and/or samples using single, average, or complete linkage
  • Principal Component Analysis (PCA) with 8 different visualizations
  • A variety of distance metrics including Euclidean, Euclidean Squared, Manhattan, Pearson Correlation, Pearson Squared and Spearman Rank Correlation
  • Profile matching using selected gene, averages of genes, or custom genes

Visualization

  • Color matrix plot (array intensity) visualization and dendrogram with selectable nodes, user-definable data range and color schemes.
  • Two-way cluster plots enabling simultaneous viewing of gene and sample clustering using partitional and hierarchical cluster experiments (or both)
  • Scatter plots, centroid plots, cluster plots
  • PNG, PDF and SVG graphics export
  • Linked visualizations to highlight selected items across plots
  • Coloring by variables and gene lists
  • Default visualizations for different experiment types

Database

  • User-definable external links to data sources such as GenBank, Unigene, and NetAffx
  • MySQL data repository with no size limits
  • Includes configuration applications for using IBM DB2 or Oracle 9i
  • Support for multiple repositories to make it easier to share copies of GeneLinker among researchers or manage multiple large projects

Scripting

  • Powerful scripting and meta-scripting capabilities that allow easy repetition and sharing of workflows

Other

  • Gene lists for support of pathways, functional classifications and ontologies
  • Variables for replicate identification, or identification of sample classes
  • Accurate progress meters, experiment canceling
  • Navigator pane that provides a hierarchical history of all analysis procedures and captures all parameter settings
  • HTML Experiment and Workflow reports.
  • Gene, sample and experiment annotations
  • Find functionality for genes allowing fast selection of desired gene
  • Ability to export data sets and PCA loadings
  • Export to Spotfire's DecisionSite

GeneLinker Platinum 4.6

GeneLinker Platinum includes all of the features of GeneLinker Gold listed above plus the following advanced analysis methods:
  • Support for external scripting (XScripts) to allow users to integrate their own proprietary analysis techniques into GeneLinker.
  • IBIS(tm) (Integrated Bayesian Inference System) is a powerful tool for identifying individual genes and pairs of genes that differ significantly between classes, taking into account the actual distribution of values within your data. It can identify non-linear and combinatorial patterns of gene expression that characterize different toxicity responses, disease states, or treatment outcomes. Furthermore, it can be used to build classifiers that can identify these patterns in new samples.
  • SLAM(tm) (Sub-Linear Association Mining) is a patented algorithm licensed by Predictive Patterns that is used to find correlations between discretized variables or to predict the outcome of a categorical variable. As an aid to supervised learning, SLAM? is used to find associations in gene expression data so that a list of interesting genes (features) can be created.
  • Committee of Neural Networks allow you to build classifiers that can predict toxicity responses, disease states, or treatment outcomes. The committee architecture is much more robust than a single Neural Network. You also have control over the following parameters:
    • The number of learners in the committee
    • The learner votes required for classification
    • The number of hidden units
    • The type of Conjugate Gradient used
    • Whether to use the Pocket Algorithm or not
    • The Neural Network Stopping Criteria
  • Committee of Support Vector Machines, which complements our Committee of Neural Networks with a powerful non-linear classification algorithm which is particularly well-suited to datasets with relatively small sample sizes. You have control over the following parameters:
    • The number of learners in the committee
    • The learner votes required for classification
    • The kernel type (linear, polynomial or radial basis function)
    • The kernel function parameters (degree, gamma, coefficient)
    • The cost

These features are most valuable to users with some knowledge of machine learning but they are accessible to anyone. For example, the Committee of Neural Networks sets sensible defaults for your data, and often yields good performance without adjusting these default values.

Minimum System Requirements
Windows NT 4.0 Service Pack 6a, 2000, XP, 95, 98 or ME (XP or 2000 strongly recommended)
Pentium II 400 MHz or equivalent, Pentium III 1 GHz recommended
256 MB RAM (512 MB or greater recommended)
200 MB of free hard disk space for installation
250 MB of free hard disk space for data

If you have any other questions, please contact:

Sales: (613) 539-1126
sales@improvedoutcomes.com
main menu
  Improved Outcomes Software
188 Greenlees Drive :: Kingston, ON Canada :: K7K 6P6
General Enquiries: 613-583-0555
Bioinformatics/GeneLinker: 613-539-1126
Image Guided Surgery: 613-583-0555
Fax: 613-271-8867
Email: info@improvedoutcomes.com
Copyright © 2004 Improved Outcomes Software
 
| Web Site Designed by WebWoods. |