Optimizing support vector machine analysis in low density biological data sets

Pablo Rivas; Sharon Moore; Urszula Iwaniec; Russell Turner; Kathy Grant; Erich Baker

doi:10.1109/CSCI46756.2018.00263

Optimizing support vector machine analysis in low density biological data sets

Pablo Rivas, Sharon Moore, Urszula Iwaniec, Russell Turner, Kathy Grant, Erich Baker

Oregon National Primate Research Center

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

2 Scopus citations

Abstract

We explore the effectiveness of Support Vector Machines (SVM) for classification in a sparse data set. Non-human primate models are utilized to analyze Alcohol Use Disorders (AUDs); however, the resulting data have a limited sample size. The challenge of low sample numbers and low replicates are explored using a variety of optimization strategies for feature extraction, including correlation, entropy, density, linear support vector machines for regression (SVR), backward SVR, and forward SVR. We investigate these approaches against the backdrop of the relationship between alcohol consumption and tibial bone mineral density. The results indicate that machine learning (ML) can effectively be used in cases of low and diverse biological data sets. The best relevance feature ranking strategies are correlation, SVR forward, and SVR backward.

Original language	English (US)
Title of host publication	Proceedings - 2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	1357-1361
Number of pages	5
ISBN (Electronic)	9781728113609
DOIs	https://doi.org/10.1109/CSCI46756.2018.00263
State	Published - Dec 2018
Event	2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018 - Las Vegas, United States Duration: Dec 13 2018 → Dec 15 2018

Publication series

Name	Proceedings - 2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018

Conference

Conference	2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018
Country/Territory	United States
City	Las Vegas
Period	12/13/18 → 12/15/18

Keywords

Bone modeling
Machine learning
Relevance feature mapping
Support vector regression

ASJC Scopus subject areas

Computer Networks and Communications
Computer Science Applications
Hardware and Architecture
Information Systems and Management
Control and Optimization
Modeling and Simulation
Artificial Intelligence

Access to Document

10.1109/CSCI46756.2018.00263

Cite this

Rivas, P., Moore, S., Iwaniec, U., Turner, R., Grant, K., & Baker, E. (2018). Optimizing support vector machine analysis in low density biological data sets. In Proceedings - 2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018 (pp. 1357-1361). Article 8947645 (Proceedings - 2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CSCI46756.2018.00263

Optimizing support vector machine analysis in low density biological data sets. / Rivas, Pablo; Moore, Sharon; Iwaniec, Urszula et al.
Proceedings - 2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018. Institute of Electrical and Electronics Engineers Inc., 2018. p. 1357-1361 8947645 (Proceedings - 2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Rivas, P, Moore, S, Iwaniec, U, Turner, R, Grant, K & Baker, E 2018, Optimizing support vector machine analysis in low density biological data sets. in Proceedings - 2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018., 8947645, Proceedings - 2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018, Institute of Electrical and Electronics Engineers Inc., pp. 1357-1361, 2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018, Las Vegas, United States, 12/13/18. https://doi.org/10.1109/CSCI46756.2018.00263

Rivas P, Moore S, Iwaniec U, Turner R, Grant K, Baker E. Optimizing support vector machine analysis in low density biological data sets. In Proceedings - 2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018. Institute of Electrical and Electronics Engineers Inc. 2018. p. 1357-1361. 8947645. (Proceedings - 2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018). doi: 10.1109/CSCI46756.2018.00263

Rivas, Pablo ; Moore, Sharon ; Iwaniec, Urszula et al. / Optimizing support vector machine analysis in low density biological data sets. Proceedings - 2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018. Institute of Electrical and Electronics Engineers Inc., 2018. pp. 1357-1361 (Proceedings - 2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018).

@inproceedings{918d8e3e806746d2a045bae931969b56,

title = "Optimizing support vector machine analysis in low density biological data sets",

abstract = "We explore the effectiveness of Support Vector Machines (SVM) for classification in a sparse data set. Non-human primate models are utilized to analyze Alcohol Use Disorders (AUDs); however, the resulting data have a limited sample size. The challenge of low sample numbers and low replicates are explored using a variety of optimization strategies for feature extraction, including correlation, entropy, density, linear support vector machines for regression (SVR), backward SVR, and forward SVR. We investigate these approaches against the backdrop of the relationship between alcohol consumption and tibial bone mineral density. The results indicate that machine learning (ML) can effectively be used in cases of low and diverse biological data sets. The best relevance feature ranking strategies are correlation, SVR forward, and SVR backward.",

keywords = "Bone modeling, Machine learning, Relevance feature mapping, Support vector regression",

author = "Pablo Rivas and Sharon Moore and Urszula Iwaniec and Russell Turner and Kathy Grant and Erich Baker",

note = "Publisher Copyright: {\textcopyright} 2018 IEEE.; 2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018 ; Conference date: 13-12-2018 Through 15-12-2018",

year = "2018",

month = dec,

doi = "10.1109/CSCI46756.2018.00263",

language = "English (US)",

series = "Proceedings - 2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1357--1361",

booktitle = "Proceedings - 2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018",

}

TY - GEN

T1 - Optimizing support vector machine analysis in low density biological data sets

AU - Rivas, Pablo

AU - Moore, Sharon

AU - Iwaniec, Urszula

AU - Turner, Russell

AU - Grant, Kathy

AU - Baker, Erich

PY - 2018/12

Y1 - 2018/12

N2 - We explore the effectiveness of Support Vector Machines (SVM) for classification in a sparse data set. Non-human primate models are utilized to analyze Alcohol Use Disorders (AUDs); however, the resulting data have a limited sample size. The challenge of low sample numbers and low replicates are explored using a variety of optimization strategies for feature extraction, including correlation, entropy, density, linear support vector machines for regression (SVR), backward SVR, and forward SVR. We investigate these approaches against the backdrop of the relationship between alcohol consumption and tibial bone mineral density. The results indicate that machine learning (ML) can effectively be used in cases of low and diverse biological data sets. The best relevance feature ranking strategies are correlation, SVR forward, and SVR backward.

AB - We explore the effectiveness of Support Vector Machines (SVM) for classification in a sparse data set. Non-human primate models are utilized to analyze Alcohol Use Disorders (AUDs); however, the resulting data have a limited sample size. The challenge of low sample numbers and low replicates are explored using a variety of optimization strategies for feature extraction, including correlation, entropy, density, linear support vector machines for regression (SVR), backward SVR, and forward SVR. We investigate these approaches against the backdrop of the relationship between alcohol consumption and tibial bone mineral density. The results indicate that machine learning (ML) can effectively be used in cases of low and diverse biological data sets. The best relevance feature ranking strategies are correlation, SVR forward, and SVR backward.

KW - Bone modeling

KW - Machine learning

KW - Relevance feature mapping

KW - Support vector regression

UR - http://www.scopus.com/inward/record.url?scp=85078516754&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85078516754&partnerID=8YFLogxK

U2 - 10.1109/CSCI46756.2018.00263

DO - 10.1109/CSCI46756.2018.00263

M3 - Conference contribution

AN - SCOPUS:85078516754

T3 - Proceedings - 2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018

SP - 1357

EP - 1361

BT - Proceedings - 2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2018 International Conference on Computational Science and Computational Intelligence, CSCI 2018

Y2 - 13 December 2018 through 15 December 2018

ER -

Optimizing support vector machine analysis in low density biological data sets

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this