Bird Conservation Status and Meaningful Socioeconomic Correlates in Central America: Results from an Open Access Data-Mining Approach for Parrots Using Machine Learning Indicate Serious Economic Problems

2015 ◽  
pp. 77-104 ◽  
Author(s):  
Cynthia Resendiz-Infante ◽  
Falk Huettmann
2018 ◽  
Vol 4 (4) ◽  
pp. 433-470 ◽  
Author(s):  
Falk Huettmann ◽  
Stefanie M. Ickert-Bond

With the advent of global online data sharing initiatives, few limits remain to using the treasure troves of museum data for biodiversity and conservation. The University of Alaska Museum Herbarium is fully online with metadata. Over 260 000 specimens representing the largest collection of Alaska plants anywhere can be data mined. We found that most specimens were collected through the National Park Service’s Inventory and Monitoring program at Denali National Park and Preserve. The majority of specimens were collected along roads, trails, coastline, or waterways, while high-altitude, remote, and pristine sampling locations are underrepresented still. Actual field efforts varied over the years, peaking in the late 1980s. From 1 to 400 specimens were collected per sampling location, and on average 40 species were obtained per collection event at a unique location. Our analysis presents a first data mining inventory of such open access data allowing for a rapid assessment, quality control, and predictive modeling involving automated high-performing machine learning algorithms and mapping analysis using open geographic information systems concepts. Our research sets a first template for more investigations in the Arctic and we briefly compare with selected specimen details from adjacent landscapes such as the Russian Far East, Canada, and the Circumpolar North.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Satoko Hiura ◽  
Shige Koseki ◽  
Kento Koyama

AbstractIn predictive microbiology, statistical models are employed to predict bacterial population behavior in food using environmental factors such as temperature, pH, and water activity. As the amount and complexity of data increase, handling all data with high-dimensional variables becomes a difficult task. We propose a data mining approach to predict bacterial behavior using a database of microbial responses to food environments. Listeria monocytogenes, which is one of pathogens, population growth and inactivation data under 1,007 environmental conditions, including five food categories (beef, culture medium, pork, seafood, and vegetables) and temperatures ranging from 0 to 25 °C, were obtained from the ComBase database (www.combase.cc). We used eXtreme gradient boosting tree, a machine learning algorithm, to predict bacterial population behavior from eight explanatory variables: ‘time’, ‘temperature’, ‘pH’, ‘water activity’, ‘initial cell counts’, ‘whether the viable count is initial cell number’, and two types of categories regarding food. The root mean square error of the observed and predicted values was approximately 1.0 log CFU regardless of food category, and this suggests the possibility of predicting viable bacterial counts in various foods. The data mining approach examined here will enable the prediction of bacterial population behavior in food by identifying hidden patterns within a large amount of data.


Sign in / Sign up

Export Citation Format

Share Document