Record Details

A Novel Statistical Method for Identifying Monotonic Relationships in Noisy Plots

ScholarsArchive at Oregon State University

Field Value
Title A Novel Statistical Method for Identifying Monotonic Relationships in Noisy Plots
Names Eberhart-Garah, Robbie (creator)
Field, Katharine (advisor)
Date Issued 2014 (iso8601)
Note Bachelor of Science (BS)
Abstract As scientific technologies and techniques have improved in past decades, it has become
possible to quickly collect unprecedented quantities of experimental data. In the study of gene expression, for instance, datasets comprising tens of thousands of variables and hundreds of treatments can be produced in a matter of months. When analyzing three or four hundred treatments, manual verification becomes implausible.A useful statistical method should, therefore, be well suited to identify subsets of data in which a relationship is expressed while ignoring non-informative data. This paper presents a novel statistical method for identifying monotonic relationships between all variable pairs in large data sets. Specifically, it is designed to perform two functions: 1) to determine the probability that a plot is random, and 2) if the plot does not appear random, to select the subplot which is most likely to contain the signal. The method presented in this paper uses the longest monotonic path of a plot (the largest set of points which can be connected in a single monotonic path) as an indicator of signal strength, and will appropriately be referred to as Longest Path Analysis (LPA) throughout this paper.
Genre Thesis
Topic gene expression
Identifier http://hdl.handle.net/1957/50687

© Western Waters Digital Library - GWLA member projects - Designed by the J. Willard Marriott Library - Hosted by Oregon State University Libraries and Press