scholarly journals DNA Sequence Chromatogram Browsing Using JAVA and CORBA

1999 ◽  
Vol 9 (3) ◽  
pp. 277-281 ◽  
Author(s):  
Jeremy D. Parsons ◽  
Eugen Buehler ◽  
LaDeana Hillier

DNA sequence chromatograms (traces) are the primary data source for all large-scale genomic and expressed sequence tags (ESTs) sequencing projects. Access to the sequencing trace assists many later analyses, for example contig assembly and polymorphism detection, but obtaining and using traces is problematic. Traces are not collected and published centrally, they are much larger than the base calls derived from them, and viewing them requires the interactivity of a local graphical client with local data. To provide efficient global access to DNA traces, we developed a client/server system based on flexible Java components integrated into other applications including an applet for use in a WWW browser and a stand-alone trace viewer. Client/server interaction is facilitated by CORBA middleware which provides a well-defined interface, a naming service, and location independence.[The software is packaged as a Jar file available from the following URL: http://www.ebi.ac.uk/∼jparsons. Links to working examples of the trace viewers can be found athttp://corba.ebi.ac.uk/EST. All the Washington University mouse EST traces are available for browsing at the same URL.]

2020 ◽  
Author(s):  
Zhong Chen ◽  
Yuhang Xu ◽  
Zhiguo Xie

The informal judgments of the well-formedness of phrases and sentences have long been used as the primary data source for syntacticians. In recent years, the reliability of data based on linguists’ introspective intuitions is increasingly subject to scrutiny. Although a number of studies were able to replicate a vast majority of English judgments published in a textbook and in peer-reviewed journal articles, the status of data in many non-English languages has yet to be experimentally examined. In this work, we em- ployed formal quantitative methods to evaluate the reliability of judgments in the widely used textbook, The Syntax of Chinese (Huang, Li, & Li, 2009). We first assessed example sentences based on the acceptability ratings from 148 native Mandarin Chinese speakers. Using a target forced-choice task, we further explored the potentially problematic sentence pairs. Results of the two experiments suggest an eminently successful replication of judgments in the book: out of the 557 data samples tested, only five sentence pairs require further investigation. This large-scale study represents the first attempt to replicate the judgments in a non-English syntax textbook, in hopes to bridge the gap between the informal data-collection in Chinese linguistic research and the protocols of experimental cognitive science.


2019 ◽  
Author(s):  
Matthias Becker ◽  
Milind Chabbi ◽  
Stefanie Warnat-Herresthal ◽  
Kathrin Klee ◽  
Jonas Schulte-Schrepping ◽  
...  

Next generation sequencing (NGS) is the driving force behind precision medicine and is revolutionizing most, if not all, areas of the life sciences. Particularly when targeting the major common diseases, an exponential growth of NGS data is foreseen for the next decades. This enormous increase of NGS data and the need to process the data quickly for real-world applications requires to rethink our current compute infrastructures. Here we provide evidence that memory-driven computing (MDC), a novel memory-centric hardware architecture, is an attractive alternative to current processor-centric compute infrastructures. To illustrate how MDC can change NGS data handling, we used RNA-seq assembly and pseudoalignment followed by quantification as two first examples. Adapting transcriptome assembly pipelines for MDC reduced compute time by 5.9-fold for the first step (SAMtools). Even more impressive, pseudoalignment by near-optimal probabilistic RNA-seq quantification (kallisto) was accelerated by more than two orders of magnitude with identical accuracy and indicated 66% reduced energy consumption. One billion RNA-seq reads were processed in just 92 seconds. Clearly, MDC simultaneously reduces data processing time and energy consumption. Together with the MDC-inherent solutions for local data privacy, a new compute model can be projected pushing large scale NGS data processing and primary data analytics closer to the edge by directly combining high-end sequencers with local MDC, thereby also reducing movement of large raw data to central cloud storage. We further envision that other data-rich areas will similarly benefit from this new memory-centric compute architecture.


2019 ◽  
Vol 12 (2) ◽  
pp. 79-100
Author(s):  
Nur Hidayati

Sayyid Abdullah bin Alwi Al-Haddad is a famous figure of Sufism. One of the books is the Minutes of Al-Mu'awanah, this study aims to find out how moral education according to Sayyid Abdullah Bin Alwi Al-Haddad in the Book of Al-Mu'awanah. The questions to be answered through this research are: (1) How is moral education according to Sayyid Abdullah bin Alwi Al-Haddad in the Book of Minutes of Al-Mu'awanah (2) What are the implications of the moral education of the Book of Al-Muawanah treatise according to Sayyid Abdullah bin Alwi Al- Haddad in everyday life. The research method used is library research. The data obtained is sourced from the literature. The primary data source is the Al-Mu'awanah Risalah, the secondary source is the translation and the other sources are the books and other books that are relevant and relevant to the research. The technical data analysis (content analysis) uses the Deductive method, the Inductive method. The findings of this study, show that the values ​​of moral education contained in the book of the Book of Al-Mu'awanah by Sayyid Abdullah bin Alwi Al-Haddad is very relevant to education now, and is needed to change students who are currently still madhmumah. ), be a person of morality (good). The model of moral education in the Book of Al-Mu'awanah is arguably very practical and still holds fast to the Qur'an and the Hadith. The thoughts of Sayyid Abdullah bin Alwi Al-Haddad about the moral education contained in the Book of Al-Mu'awanah can be grouped into three large-scale writers. First: Morals to Allah SWT. Second: Moral towards yourself. Third: Moral towards the environment


2020 ◽  
Vol 10 (16) ◽  
pp. 5452
Author(s):  
Khireddine Benaissa ◽  
Salim Bitam ◽  
Abdelhamid Mellouk

Basic Safety Messages that are frequently generated from multiple connected vehicles can play a primordial role in providing transport data see credible and reliable information they contain. Otherwise, when considering the way Basic Safety Messages (BSMs) are treated, multiple deficiencies prevent the latter to be capable of constituting a precious data source. As we know, data become more useful the more widely are used, which is the exact opposite of what happens with the BSMs that exist only temporarily, used locally, considered disposable, and are never stored. In this paper, we introduce a data reuse model that retains collected BSMs, stores, and processes them inside the vehicle constituting a continuous data source holding retained snapshots along the roadway. Our model provided a primary data source available on a large scale, considered to be a worthy dataset for machine learning tasks, capable of visualizing different traffic-related indicators to enhance analytics and support decisions-making. In the study case, we set up an in-vehicle data platform, where we achieved an 80% of BSMs size reduction and provided a rich set of APIs to serve applications. We also adopted the Artificial Neural Networks (ANN) as an information processing paradigm for performing traffic volume prediction, where the obtained results have reached over 99% of accuracy.


2018 ◽  
Vol 10 (2) ◽  
pp. 269-295
Author(s):  
Sri Waluyo

This paper discusses the content of Q.S. al-Baqarah ([2]: 67-73). The data used in the preparation of this paper is the data that is primary and secondary. The primary source is data obtained from the core source. In conducting a study of a verse, it is clear that the primary data source is derived from the Qur'an,precisely on Q.S. al-Baqarah ([2]: 67-73). Secondary data is dataobtained from other sources that are still related to the problemand provide interpretation of the primary source. The method usedin analyzing this paper is the tahlili method. This method describesthe meaning contained by the Qur'an, verse by verse, and letterafter letter according to the order in the Mushaf. The descriptionincludes the various aspects which the interpreted verses contain,such as the meaning of the vocabulary, the connotation of thesentence, the background of the verse down, its relation to otherverses, both before and after. And do not miss the opinion that hasbeen given regarding the interpretation of these verses, whetherdelivered by the Prophet, companions, the tabi'in, as well as othercommentators. This study shows that in Q.S. (2): 67-73) there arevalues of moral education which include: 1) morals in asking, (2)morals to parents, (3) patience of educators, (4) educator honesty,and (5) obedience of learners.


2020 ◽  
Vol 2 (1) ◽  
pp. 75-87
Author(s):  
Syarifah Nuriah ◽  
Abdul Rakhman Laba ◽  
Muhammad Sobarsyah

This study aims to determine the management and control system of trade receivables on the effectiveness of the cash flow company’s at PT. Enseval Putera Megatrading, Tbk. The data source used in this study is the primary data source, obtained directly from the company. The analytical method used for testing the management and control system of receivables on the effectiveness of cash flow is the analysis of financial ratios. In this study, the data used for analysis are qualitative data analysis and financial ratio analysis, namely the activity ratio (RTO, ACP, Arrears Ratio, and Billing Ratio). The results showed that (1) RTO of PT. Enseval Putera Megatrading, the highest rate in 2017 was 189 times, while the lowest RTO was in 2015 which was 115 times. This shows the normal level of turnover. The faster the payment terms, the better for the company, because the faster the working capital embedded in receivables returns to capital or cash, which means the higher the receivables turnover. (2) ACP or the average age of collection of receivables applied by companies, especially the value in 2017 is 2 days. This means that the company has been effective in managing its accounts receivable because the standard for collecting receivables set by the company is the repayment limit or due date no later than 7 (seven) to 90 (Ninety) calendar days from the billing statement received by the service user. (3) The Arrears Ratio, namely from 2014-2018, the largest was only 1.11%. This shows that the lower the arrears ratio, the better for the company, which means the company is able to handle its receivables properly. (4) Billing Ratio shows that from 2014-2018 the lowest that is 98.88% shows the greater the value of collectible receivables means the greater the percentage value of the collection ratio so that the better for the company because of the greater return on corporate capital. Then it can be concluded that the Billing Ratio of PT. Enseval Putera Megatrading, it's not working effectively.


2018 ◽  
Vol 2 (2) ◽  
pp. 151-162
Author(s):  
Atiya Mahmud Hana

  This study aims to observe and describe the use of speech acts by Barack Obama when he announced the death of Osama bin Laden. The writer focuses on illocutionary acts used by Barack Obama. The primary data source is the transcript of Barack Obama’s speech at White House on May 1st, 2011 after the death of Osama bin Laden. The types of illocutionary acts are observed by the writer according to Searle’s Taxonomy of Illocutionary Act. They are representatives, directives, commissives, expressives, and declarations. The result of the study shows that representatives are frequetnly used by Obama in his speech. Representatives are used in 54 utterances (74%); Commissives are used in 11 utterances (15%); Expressives are used in 7 utterances (11%). Barack Obama used none both directive speech acts and declaration speech acts. Representatives are frequently used in Barack Obama speech because the purpose of the speech is to announce the death of Osama bin Laden in Pakistan. The evidence is that most utterances in the transcipt use statements, descriptions, and reports.   


2021 ◽  
Vol 8 (1) ◽  
Author(s):  
Deborah O. Dele-Oni ◽  
Karen E. Christianson ◽  
Shawn B. Egri ◽  
Alvaro Sebastian Vaca Jacome ◽  
Katherine C. DeRuff ◽  
...  

AbstractWhile gene expression profiling has traditionally been the method of choice for large-scale perturbational profiling studies, proteomics has emerged as an effective tool in this context for directly monitoring cellular responses to perturbations. We previously reported a pilot library containing 3400 profiles of multiple perturbations across diverse cellular backgrounds in the reduced-representation phosphoproteome (P100) and chromatin space (Global Chromatin Profiling, GCP). Here, we expand our original dataset to include profiles from a new set of cardiotoxic compounds and from astrocytes, an additional neural cell model, totaling 5300 proteomic signatures. We describe filtering criteria and quality control metrics used to assess and validate the technical quality and reproducibility of our data. To demonstrate the power of the library, we present two case studies where data is queried using the concept of “connectivity” to obtain biological insight. All data presented in this study have been deposited to the ProteomeXchange Consortium with identifiers PXD017458 (P100) and PXD017459 (GCP) and can be queried at https://clue.io/proteomics.


2021 ◽  
Author(s):  
Jeffrey A. Boatman ◽  
David M. Vock ◽  
Joseph S. Koopmeiners

Sign in / Sign up

Export Citation Format

Share Document