The Phantom Pattern Problem

Mapping Intimacies ◽

10.1093/oso/9780198864165.001.0001 ◽

2020 ◽

Author(s):

Gary Smith ◽

Jay Cordes

Keyword(s):

Pattern Recognition ◽

Big Data ◽

Unlimited Number ◽

Pattern Problem

Pattern recognition prowess served our ancestors well. However, today we are confronted by a deluge of data that are far more abstract, complicated, and difficult to interpret than were annual seasons and the sounds of predators. The number of possible patterns that can be identified relative to the number that are genuinely useful has grown exponentially—which means that the chances that a discovered pattern is useful is rapidly approaching zero. Coincidental streaks, clusters, and correlations are the norm—not the exception. Our challenge is to overcome our inherited inclination to think that all patterns are meaningful.Computer algorithms can easily identify an essentially unlimited number of phantom patterns and relationships that vanish when confronted with fresh data. The paradox of big data is that the more data we ransack for patterns, the more likely it is that what we find will be worthless. Our challenge is to overcome our inherited inclination to think that all patterns are meaningful.

Download Full-text

Big Data Analytics and Pattern Recognition Methods in the Problem of Optimization of Technological Processes in Metallurgical Production

Journal of Physics Conference Series ◽

10.1088/1742-6596/913/1/012003 ◽

2017 ◽

Vol 913 ◽

pp. 012003

Author(s):

D Gainanov ◽

D Berenov

Keyword(s):

Pattern Recognition ◽

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Metallurgical Production ◽

Technological Processes ◽

Pattern Recognition Methods

Download Full-text

Pattern Recognition: Evolution, Mining and Big Data

Pattern Recognition and Big Data ◽

10.1142/9789813144552_0001 ◽

2016 ◽

pp. 1-36 ◽

Cited By ~ 3

Author(s):

Amita Pal ◽

Sankar K. Pal

Keyword(s):

Pattern Recognition ◽

Big Data

Download Full-text

Artificial Neural Networks and Their Applications in Business

Encyclopedia of Information Science and Technology, Fourth Edition ◽

10.4018/978-1-5225-2255-3.ch576 ◽

2018 ◽

pp. 6642-6657

Author(s):

Trevor J. Bihl ◽

William A. Young II ◽

Gary R. Weckman

Keyword(s):

Neural Networks ◽

Pattern Recognition ◽

Big Data ◽

Artificial Neural Networks ◽

Model Complexity ◽

Ann Model ◽

Processing Methods ◽

Starting Point ◽

Clustering And Classification ◽

Artificial Neural

Despite the natural advantage humans have for recognizing and interpreting patterns, large and complex datasets, as in Big Data, preclude efficient human analysis. Artificial neural networks (ANNs) provide a family of pattern recognition approaches for prediction, clustering and classification applicable to KDD with ANN model complexity ranging from simple (for small problems) highly complex (for large issues). To provide a starting point for readers, this chapter first describes foundational concepts that relate to ANNs. A listing of commonly used ANN methods, heuristics, and criteria for initializing ANNs is then discussed. Common pre- and post- data processing methods for dimensionality reduction and data quality issues are then described. The authors then provide a tutorial example of ANN analysis. Finally, the authors list and describe applications of ANNs to specific business related endeavors for further reading.

Download Full-text

Artificial Neural Networks and Their Applications in Business

Advanced Methodologies and Technologies in Artificial Intelligence, Computer Simulation, and Human-Computer Interaction - Advances in Computer and Electrical Engineering ◽

10.4018/978-1-5225-7368-5.ch074 ◽

2019 ◽

pp. 1009-1027

Author(s):

Trevor J. Bihl ◽

William A. Young II ◽

Gary R. Weckman

Keyword(s):

Neural Networks ◽

Pattern Recognition ◽

Big Data ◽

Artificial Neural Networks ◽

Model Complexity ◽

Ann Model ◽

Processing Methods ◽

Starting Point ◽

Clustering And Classification ◽

Artificial Neural

Despite the natural advantage humans have for recognizing and interpreting patterns, large and complex datasets, as in big data, preclude efficient human analysis. Artificial neural networks (ANNs) provide a family of pattern recognition approaches for prediction, clustering, and classification applicable to KDD with ANN model complexity ranging from simple (for small problems) to highly complex (for large issues). To provide a starting point for readers, this chapter first describes foundational concepts that relate to ANNs. A listing of commonly used ANN methods, heuristics, and criteria for initializing ANNs are then discussed. Common pre- and post-data processing methods for dimensionality reduction and data quality issues are then described. The authors then provide a tutorial example of ANN analysis. Finally, the authors list and describe applications of ANNs to specific business-related endeavors for further reading.

Download Full-text

The Reproducibility Crisis

The Phantom Pattern Problem ◽

10.1093/oso/9780198864165.003.0008 ◽

2020 ◽

pp. 137-152

Author(s):

Gary Smith ◽

Jay Cordes

Keyword(s):

Data Mining ◽

Big Data ◽

Statistical Evidence ◽

Unlimited Number

Attempts to replicate reported studies often fail because the research relied on data mining—searching through data for patterns without any pre-specified, coherent theories. The perils of data mining can be exacerbated by data torturing—slicing, dicing, and otherwise mangling data to create patterns. If there is no underlying reason for a pattern, it is likely to disappear when someone attempts to replicate the study. Big data and powerful computers are part of the problem, not the solution, in that they can easily identify an essentially unlimited number of phantom patterns and relationships, which vanish when confronted with fresh data. If a researcher will benefit from a claim, it is likely to be biased. If a claim sounds implausible, it is probably misleading. If the statistical evidence sounds too good to be true, it probably is.

Download Full-text

A Survey on Challenges Facing Artificial Intelligence Based Pattern Recognition for Business Oriented Big Data Analytics and Solutions

Software Engineering and Algorithms - Lecture Notes in Networks and Systems ◽

10.1007/978-3-030-77442-4_20 ◽

2021 ◽

pp. 237-248

Author(s):

Ahmed Maghawry ◽

Amr Elhadidi ◽

Ahmed Alqassed ◽

Mohamed Awad ◽

Ayman Taha ◽

...

Keyword(s):

Artificial Intelligence ◽

Pattern Recognition ◽

Big Data ◽

Data Analytics ◽

Big Data Analytics

Download Full-text

Pattern Recognition Method of English Distance Online Education Based on Big Data Algorithm

Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering - e-Learning, e-Education, and Online Training ◽

10.1007/978-3-030-84383-0_24 ◽

2021 ◽

pp. 278-288

Author(s):

Xiao-xiao Duan ◽

Ping Duan

Keyword(s):

Pattern Recognition ◽

Big Data ◽

Online Education ◽

Pattern Recognition Method ◽

Recognition Method

Download Full-text

Pattern Recognition and Big Data

10.1142/10153 ◽

2016 ◽

Cited By ~ 2

Author(s):

Amita Pal ◽

Sankar K Pal

Keyword(s):

Pattern Recognition ◽

Big Data

Download Full-text

Session Introduction — Pattern Recognition in Biomedical Data: Challenges in putting big data to work

Biocomputing 2019 ◽

10.1142/9789813279827_0001 ◽

2018 ◽

Author(s):

Shefali Setia Verma ◽

Anurag Verma ◽

Dokyoon Kim ◽

Christian Darabos

Keyword(s):

Pattern Recognition ◽

Big Data ◽

Biomedical Data

Download Full-text

Big Data Analytics and Structural Health Monitoring: A Statistical Pattern Recognition-Based Approach

Sensors ◽

10.3390/s20082328 ◽

2020 ◽

Vol 20 (8) ◽

pp. 2328 ◽

Cited By ~ 8

Author(s):

Alireza Entezami ◽

Hassan Sarmadi ◽

Behshid Behkamal ◽

Stefano Mariani

Keyword(s):

Pattern Recognition ◽

Feature Extraction ◽

Big Data ◽

Structural Health Monitoring ◽

Health Monitoring ◽

Statistical Pattern Recognition ◽

Statistical Decision ◽

Structural Health ◽

Statistical Pattern ◽

Arma Modeling

Recent advances in sensor technologies and data acquisition systems opened up the era of big data in the field of structural health monitoring (SHM). Data-driven methods based on statistical pattern recognition provide outstanding opportunities to implement a long-term SHM strategy, by exploiting measured vibration data. However, their main limitation, due to big data or high-dimensional features, is linked to the complex and time-consuming procedures for feature extraction and/or statistical decision-making. To cope with this issue, in this article we propose a strategy based on autoregressive moving average (ARMA) modeling for feature extraction, and on an innovative hybrid divergence-based method for feature classification. Data relevant to a cable-stayed bridge are accounted for to assess the effectiveness and efficiency of the proposed method. The results show that the offered hybrid divergence-based method, in conjunction with ARMA modeling, succeeds in detecting damage in cases strongly characterized by big data.

Download Full-text