Record Details
Field | Value |
---|---|
Title | A Novel Statistical Method for Identifying Monotonic Relationships in Noisy Plots |
Names |
Eberhart-Garah, Robbie
(creator) Field, Katharine (advisor) |
Date Issued | 2014 (iso8601) |
Note | Bachelor of Science (BS) |
Abstract | As scientific technologies and techniques have improved in past decades, it has become possible to quickly collect unprecedented quantities of experimental data. In the study of gene expression, for instance, datasets comprising tens of thousands of variables and hundreds of treatments can be produced in a matter of months. When analyzing three or four hundred treatments, manual verification becomes implausible.A useful statistical method should, therefore, be well suited to identify subsets of data in which a relationship is expressed while ignoring non-informative data. This paper presents a novel statistical method for identifying monotonic relationships between all variable pairs in large data sets. Specifically, it is designed to perform two functions: 1) to determine the probability that a plot is random, and 2) if the plot does not appear random, to select the subplot which is most likely to contain the signal. The method presented in this paper uses the longest monotonic path of a plot (the largest set of points which can be connected in a single monotonic path) as an indicator of signal strength, and will appropriately be referred to as Longest Path Analysis (LPA) throughout this paper. |
Genre | Thesis |
Topic | gene expression |
Identifier | http://hdl.handle.net/1957/50687 |