Tensor relational algebra for distributed machine learning system design

We consider the question: what is the abstraction that should be implemented by the computational engine of a machine learning system? Current machine learning systems typically push whole tensors through a series of compute kernels such as matrix multiplications or activation functions, where each kernel runs on an AI accelerator (ASIC) such as a GPU. This implementation abstraction provides little built-in support for ML systems to scale past a single machine, or for handling large models with matrices or tensors that do not easily fit into the RAM of an ASIC. In this paper, we present an alternative implementation abstraction called the tensor relational algebra (TRA). The TRA is a set-based algebra based on the relational algebra. Expressions in the TRA operate over binary tensor relations, where keys are multi-dimensional arrays and values are tensors. The TRA is easily executed with high efficiency in a parallel or distributed environment, and amenable to automatic optimization. Our empirical study shows that the optimized TRA-based back-end can significantly outperform alternatives for running ML workflows in distributed clusters.

Download Full-text

The Tensor-Relational Algebra, and Other Ideas in Machine Learning System Design

33rd International Conference on Scientific and Statistical Database Management ◽

10.1145/3468791.3472262 ◽

2021 ◽

Author(s):

Chris Jermaine

Keyword(s):

Machine Learning ◽

System Design ◽

Learning System ◽

Relational Algebra

Download Full-text

Insider Collusion Attack on Distributed Machine Learning System and its Solutions - A Case of SVM

Proceedings of the 7th ACM Workshop on ASIA Public-Key Cryptography ◽

10.1145/3384940.3390638 ◽

2020 ◽

Author(s):

Peter Shaojui Wang

Keyword(s):

Machine Learning ◽

Learning System ◽

Collusion Attack ◽

Distributed Machine Learning

Download Full-text

An Empirical Study of Refactorings and Technical Debt in Machine Learning Systems

2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE) ◽

10.1109/icse43902.2021.00033 ◽

2021 ◽

Author(s):

Yiming Tang ◽

Raffi Khatchadourian ◽

Mehdi Bagherzadeh ◽

Rhia Singh ◽

Ajani Stewart ◽

...

Keyword(s):

Machine Learning ◽

Empirical Study ◽

Learning Systems ◽

Technical Debt

Download Full-text

An Empirical Study of Bugs in Machine Learning Systems

2012 IEEE 23rd International Symposium on Software Reliability Engineering ◽

10.1109/issre.2012.22 ◽

2012 ◽

Cited By ~ 30

Author(s):

Ferdian Thung ◽

Shaowei Wang ◽

David Lo ◽

Lingxiao Jiang

Keyword(s):

Machine Learning ◽

Empirical Study ◽

Learning Systems

Download Full-text

ArchNet: A data hiding design for distributed machine learning systems

Journal of Systems Architecture ◽

10.1016/j.sysarc.2020.101912 ◽

2020 ◽

pp. 101912

Author(s):

Kaiyan Chang ◽

Wei Jiang ◽

Jinyu Zhan ◽

Zicheng Gong ◽

Weijia Pan

Keyword(s):

Machine Learning ◽

Data Hiding ◽

Learning Systems ◽

Distributed Machine Learning

Download Full-text

Machine learning for human learners: opportunities, issues, tensions and threats

Educational Technology Research and Development ◽

10.1007/s11423-020-09858-2 ◽

2020 ◽

Author(s):

Mary E. Webb ◽

Andrew Fluck ◽

Johannes Magenheim ◽

Joyce Malyn-Smith ◽

Juliet Waters ◽

...

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Ethical Issues ◽

Practical Experience ◽

Learning Systems ◽

Learning System ◽

Adaptive Behaviour ◽

Recent Developments ◽

Key Aspects ◽

School Curricula

AbstractMachine learning systems are infiltrating our lives and are beginning to become important in our education systems. This article, developed from a synthesis and analysis of previous research, examines the implications of recent developments in machine learning for human learners and learning. In this article we first compare deep learning in computers and humans to examine their similarities and differences. Deep learning is identified as a sub-set of machine learning, which is itself a component of artificial intelligence. Deep learning often depends on backwards propagation in weighted neural networks, so is non-deterministic—the system adapts and changes through practical experience or training. This adaptive behaviour predicates the need for explainability and accountability in such systems. Accountability is the reverse of explainability. Explainability flows through the system from inputs to output (decision) whereas accountability flows backwards, from a decision to the person taking responsibility for it. Both explainability and accountability should be incorporated in machine learning system design from the outset to meet social, ethical and legislative requirements. For students to be able to understand the nature of the systems that may be supporting their own learning as well as to act as responsible citizens in contemplating the ethical issues that machine learning raises, they need to understand key aspects of machine learning systems and have opportunities to adapt and create such systems. Therefore, some changes are needed to school curricula. The article concludes with recommendations about machine learning for teachers, students, policymakers, developers and researchers.

Download Full-text

Accelerating training of DNN in distributed machine learning system with shared memory

2017 International Conference on Information and Communication Technology Convergence (ICTC) ◽

10.1109/ictc.2017.8190900 ◽

2017 ◽

Author(s):

Eun-Ji Lim ◽

Shin-Young Ahn ◽

Wan Choi

Keyword(s):

Machine Learning ◽

Shared Memory ◽

Learning System ◽

Distributed Machine Learning

Download Full-text

Blockchain for federated learning toward secure distributed machine learning systems: a systemic survey

Soft Computing ◽

10.1007/s00500-021-06496-5 ◽

2021 ◽

Author(s):

Dun Li ◽

Dezhi Han ◽

Tien-Hsiung Weng ◽

Zibin Zheng ◽

Hongzhi Li ◽

...

Keyword(s):

Machine Learning ◽

Learning Systems ◽

Distributed Machine Learning

Download Full-text

Intelligent Big Data Analytics in Health

10.4018/978-1-6684-3662-2.ch081 ◽

2022 ◽

pp. 1663-1702

Author(s):

Ebru Aydindag Bayrak ◽

Pinar Kirci

Keyword(s):

Machine Learning ◽

Big Data ◽

Early Diagnosis ◽

Neurological Disorders ◽

Data Analytics ◽

Big Data Analytics ◽

Healthcare Systems ◽

Learning Systems ◽

Learning System ◽

Definition Of

Intelligent big data analytics and machine learning systems have been introduced to explain for the early diagnosis of neurological disorders. A number of scholarly researches about intelligent big data analytics in healthcare and machine learning system used in the healthcare system have been mentioned. The authors have explained the definition of big data, big data samples, and big data analytics. But the main goal is helping researchers or specialists in providing opinion about diagnosing or predicting neurological disorders using intelligent big data analytics and machine learning. Therefore, they focused on the healthcare systems using these innovative ways in particular. The information of platform and tools about big data analytics in healthcare is investigated. Numerous academic studies based on the detection of neurological disorders using both machine learning methods and big data analytics have been reviewed.

Download Full-text

A Systematic Literature Review on Federated Machine Learning

ACM Computing Surveys ◽

10.1145/3450288 ◽

2021 ◽

Vol 54 (5) ◽

pp. 1-39

Author(s):

Sin Kit Lo ◽

Qinghua Lu ◽

Chen Wang ◽

Hye-Young Paik ◽

Liming Zhu

Keyword(s):

Machine Learning ◽

Literature Review ◽

Systematic Literature Review ◽

State Of The Art ◽

System Development ◽

Learning Systems ◽

Learning System ◽

Requirement Analysis ◽

Future Trends ◽

Data Synthesis

Federated learning is an emerging machine learning paradigm where clients train models locally and formulate a global model based on the local model updates. To identify the state-of-the-art in federated learning and explore how to develop federated learning systems, we perform a systematic literature review from a software engineering perspective, based on 231 primary studies. Our data synthesis covers the lifecycle of federated learning system development that includes background understanding, requirement analysis, architecture design, implementation, and evaluation. We highlight and summarise the findings from the results and identify future trends to encourage researchers to advance their current work.

Download Full-text