Computer Science Department Colloquium
San Jose State University

Active Learning Support Vector Machines to Classify Imbalanced Reservoir Simulation Data

Tina Yu
Associate Professor,
Computer Science, Memorial University

Oct. 31, 2013, 3pm.
MacQuarrie Hall 225
(Refreshments, MH229 at 2pm)


Reservoir modeling is an on-going activity during the production life of a reservoir. One challenge to constructing accurate reservoir models is the time required to carry out a large number of computer simulations. To address this issue, we have constructed surrogate models (proxies) for the computer simulator to reduce the simulation time. The quality of the proxies, however, relies on the quality of the computer simulation data. Frequently, the majority of the simulation outputs match poorly to the production data collected from the field. In other words, most of the data describe the characteristics of what the reservoir is not (negative samples), rather than what the reservoir is (positive samples). Applying machine learning methods to train a simulator proxy based on these data faces the challenge of imbalanced training data. This work applies active learning support vector machines to incrementally select a subset of informative simulation data to train a classifier as the simulator proxy. We compare the results with the results produced by the standard support vector machines combined with other imbalanced training data handling techniques. Based on the support vectors in the trained classifiers, we analyze high impact parameters that separating good-matching reservoir models from bad-matching models.

About the Speaker

Tina Yu is an Associate Professor in the Department of Computer Science at Memorial University of Newfoundland, Canada. Her current research interests are in quantum information processing and in applying machine learning and evolutionary algorithms to create intelligent systems for education, finance and energy applications. Tina is on the editorial board of the Evolutionary Computation Journal and the Genetic Programming and Evolvable Machines.