Using training set selection methods to improve text mining on market prediction via news headlines

Author(s):  
Farzad Niknam ◽  
Aliakbar Niknafs
Author(s):  
Andrew F. Zahrt ◽  
Brennan T. Rose ◽  
William T. Darrow ◽  
Jeremy J. Henle ◽  
Scott E. Denmark

Different subset selection methods are examined to guide catalyst selection in optimization campaigns. Error assessment methods are used to quantitatively inform selection of new catalyst candidates from in silico libraries of catalyst structures.


1995 ◽  
Vol 3 (4) ◽  
pp. 279-292 ◽  
Author(s):  
I. T. Cousins ◽  
M. T. D. Cronin ◽  
J. C. Dearden ◽  
C. D. Watts

2015 ◽  
Vol 42 (1) ◽  
pp. 306-324 ◽  
Author(s):  
Arman Khadjeh Nassirtoussi ◽  
Saeed Aghabozorgi ◽  
Teh Ying Wah ◽  
David Chek Ling Ngo

Author(s):  
Tomasz Kajdanowicz ◽  
Slawomir Plamowski ◽  
Przemyslaw Kazienko

Choosing a proper training set for machine learning tasks is of great importance in complex domain problems. In the paper a new distance measure for training set selection is presented and thoroughly discussed. The distance between two datasets is computed using variance of entropy in groups obtained after clustering. The approach is validated using real domain datasets from debt portfolio valuation process. Eventually, prediction performance is examined.


Sign in / Sign up

Export Citation Format

Share Document