Record Details
Field | Value |
---|---|
Title | Comparison of machine learning methods for predicting bird distributions |
Names |
Bikkina, Vinay
(creator) Dietterich, Thomas G. (advisor) Hutchinson, Rebecca A. (advisor) |
Date Issued | 2014-06-09 (iso8601) |
Note | Honors Bachelor of Science (HBS) |
Abstract | The purpose of this study is to explore kernel machine learning methods for species distribution modeling. Previous studies have shown the success of Generalized Boosted Regression Models, however kernel methods have been unexplored for species distribution modeling. Using the eBird dataset, four machine learning methods were tested for accuracy and speed. Accuracy was measured in terms of the Area Under the Curve of the Receiver Operating Characteristic curve. The eBird dataset was divided into training, validation, and testing sets to ensure a fair comparison between the methods, including cross validation for the tuning parameters. The four methods tested were: Generalized Linear Models (GLM), Generalized Boosted Regression Models (GBM), Kernel Support Vector Machines (KSVM), and Kernel Logistic Regression (KLR). The results show that GBM performs better than the kernel methods and the baseline GLM. GBM is not the fastest method, but this is less important since the prediction for species distribution modeling is not a time sensitive matter. Therefore, GBM was found to be the best out of the four methods for species distribution modeling. |
Genre | Thesis |
Topic | Machine learning |
Identifier | http://hdl.handle.net/1957/48849 |