Record Details

Comparison of machine learning methods for predicting bird distributions

ScholarsArchive at Oregon State University

Field Value
Title Comparison of machine learning methods for predicting bird distributions
Names Bikkina, Vinay (creator)
Dietterich, Thomas G. (advisor)
Hutchinson, Rebecca A. (advisor)
Date Issued 2014-06-09 (iso8601)
Note Honors Bachelor of Science (HBS)
Abstract The purpose of this study is to explore kernel machine learning methods for species distribution modeling. Previous studies have shown the success of Generalized Boosted Regression Models, however kernel methods have been unexplored for species distribution modeling. Using the eBird dataset, four machine learning methods were tested for accuracy and speed. Accuracy was measured in terms of the Area Under the Curve of the Receiver Operating Characteristic curve. The eBird dataset was divided into training, validation, and testing sets to ensure a fair comparison between the methods, including cross validation for the tuning parameters. The four methods tested were: Generalized Linear Models (GLM), Generalized Boosted Regression Models (GBM), Kernel Support Vector Machines (KSVM), and Kernel Logistic Regression (KLR). The results show that GBM performs better than the kernel methods and the baseline GLM. GBM is not the fastest method, but this is less important since the prediction for species distribution modeling is not a time sensitive matter. Therefore, GBM was found to be the best out of the four methods for species distribution modeling.
Genre Thesis
Topic Machine learning
Identifier http://hdl.handle.net/1957/48849

© Western Waters Digital Library - GWLA member projects - Designed by the J. Willard Marriott Library - Hosted by Oregon State University Libraries and Press