Evolvix BEST Names for semantic reproducibility across code2brain interfaces
Abstract
Names in programming are vital for understanding the meaning of code and big data. We define code2brain (C2B) interfaces as maps in compilers and brains between meaning and naming syntax, which help to understand executable code. While working toward an Evolvix syntax for general‐purpose programming that makes accurate modeling easy for biologists, we observed how names affect C2B quality. To protect learning and coding investments, C2B interfaces require long‐term backward compatibility and semantic reproducibility (accurate reproduction of computational meaning from coder‐brains to reader‐brains by code alone). Semantic reproducibility is often assumed until confusing synonyms degrade modeling in biology to deciphering exercises. We highlight empirical naming priorities from diverse individuals and roles of names in different modes of computing to show how naming easily becomes impossibly difficult. We present the Evolvix BEST (Brief, Explicit, Summarizing, Technical) Names concept for reducing naming priority conflicts, test it on a real challenge by naming subfolders for the Project Organization Stabilizing Tool system, and provide naming questionnaires designed to facilitate C2B debugging by improving names used as keywords in a stabilizing programming language. Our experiences inspired us to develop Evolvix using a flipped programming language design approach with some unexpected features and BEST Names atmore »
- Authors:
-
more »
- Wisconsin Institute for Discovery, Laboratory of Genetics
- Wisconsin Institute for Discovery
- Departments of Computer Sciences
- Statistics
- Wisconsin Institute for Discovery, Departments of Computer Sciences
- Wisconsin Institute for Discovery, Chemical and Biological Engineering University of Wisconsin‐Madison Madison Wisconsin
- Department of Computer Science University of Porto Porto Portugal
- Department of Biology Cedar Crest College Allentown Pennsylvania
- Department of Biology Spelman College Atlanta Georgia
- College of the Menominee Nation Keshena Wisconsin
- Publication Date:
- Research Org.:
- Argonne National Laboratory (ANL), Argonne, IL (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC); National Science Foundation (NSF); National Institutes of Health (NIH); NHGRI; NIGMS; University of Wisconsin-Madison; U.S. Air Force
- OSTI Identifier:
- 1334369
- Alternate Identifier(s):
- OSTI ID: 1341331; OSTI ID: 1625920
- Grant/Contract Number:
- DE‐AC02‐06CH11357; AC02-06CH11357; T32GM007133; 5T32HG002760; 1R01GM086445‐01; FA9550‐15‐1‐0212; 1149123
- Resource Type:
- Published Article
- Journal Name:
- Annals of the New York Academy of Sciences
- Additional Journal Information:
- Journal Name: Annals of the New York Academy of Sciences Journal Volume: 1387 Journal Issue: 1; Journal ID: ISSN 0077-8923
- Publisher:
- Wiley-Blackwell
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 97 MATHEMATICS AND COMPUTING; Science & Technology - Other Topics
Citation Formats
Loewe, Laurence, Scheuer, Katherine S., Keel, Seth A., Vyas, Vaibhav, Liblit, Ben, Hanlon, Bret, Ferris, Michael C., Yin, John, Dutra, Inês, Pietsch, Anthony, Javid, Christine G., Moog, Cecilia L., Meyer, Jocelyn, Dresel, Jerdon, McLoone, Brian, Loberger, Sonya, Movaghar, Arezoo, Gilchrist‐Scott, Morgaine, Sabri, Yazeed, Sescleifer, Dave, Pereda‐Zorrilla, Ivan, Zietlow, Andrew, Smith, Rodrigo, Pietenpol, Samantha, Goldfinger, Jacob, Atzen, Sarah L., Freiberg, Erika, Waters, Noah P., Nusbaum, Claire, Nolan, Erik, Hotz, Alyssa, Kliman, Richard M., Mentewab, Ayalew, Fregien, Nathan, and Loewe, Martha. Evolvix BEST Names for semantic reproducibility across code2brain interfaces. United States: N. p., 2016.
Web. doi:10.1111/nyas.13192.
Loewe, Laurence, Scheuer, Katherine S., Keel, Seth A., Vyas, Vaibhav, Liblit, Ben, Hanlon, Bret, Ferris, Michael C., Yin, John, Dutra, Inês, Pietsch, Anthony, Javid, Christine G., Moog, Cecilia L., Meyer, Jocelyn, Dresel, Jerdon, McLoone, Brian, Loberger, Sonya, Movaghar, Arezoo, Gilchrist‐Scott, Morgaine, Sabri, Yazeed, Sescleifer, Dave, Pereda‐Zorrilla, Ivan, Zietlow, Andrew, Smith, Rodrigo, Pietenpol, Samantha, Goldfinger, Jacob, Atzen, Sarah L., Freiberg, Erika, Waters, Noah P., Nusbaum, Claire, Nolan, Erik, Hotz, Alyssa, Kliman, Richard M., Mentewab, Ayalew, Fregien, Nathan, & Loewe, Martha. Evolvix BEST Names for semantic reproducibility across code2brain interfaces. United States. https://doi.org/10.1111/nyas.13192
Loewe, Laurence, Scheuer, Katherine S., Keel, Seth A., Vyas, Vaibhav, Liblit, Ben, Hanlon, Bret, Ferris, Michael C., Yin, John, Dutra, Inês, Pietsch, Anthony, Javid, Christine G., Moog, Cecilia L., Meyer, Jocelyn, Dresel, Jerdon, McLoone, Brian, Loberger, Sonya, Movaghar, Arezoo, Gilchrist‐Scott, Morgaine, Sabri, Yazeed, Sescleifer, Dave, Pereda‐Zorrilla, Ivan, Zietlow, Andrew, Smith, Rodrigo, Pietenpol, Samantha, Goldfinger, Jacob, Atzen, Sarah L., Freiberg, Erika, Waters, Noah P., Nusbaum, Claire, Nolan, Erik, Hotz, Alyssa, Kliman, Richard M., Mentewab, Ayalew, Fregien, Nathan, and Loewe, Martha. Mon .
"Evolvix BEST Names for semantic reproducibility across code2brain interfaces". United States. https://doi.org/10.1111/nyas.13192.
@article{osti_1334369,
title = {Evolvix BEST Names for semantic reproducibility across code2brain interfaces},
author = {Loewe, Laurence and Scheuer, Katherine S. and Keel, Seth A. and Vyas, Vaibhav and Liblit, Ben and Hanlon, Bret and Ferris, Michael C. and Yin, John and Dutra, Inês and Pietsch, Anthony and Javid, Christine G. and Moog, Cecilia L. and Meyer, Jocelyn and Dresel, Jerdon and McLoone, Brian and Loberger, Sonya and Movaghar, Arezoo and Gilchrist‐Scott, Morgaine and Sabri, Yazeed and Sescleifer, Dave and Pereda‐Zorrilla, Ivan and Zietlow, Andrew and Smith, Rodrigo and Pietenpol, Samantha and Goldfinger, Jacob and Atzen, Sarah L. and Freiberg, Erika and Waters, Noah P. and Nusbaum, Claire and Nolan, Erik and Hotz, Alyssa and Kliman, Richard M. and Mentewab, Ayalew and Fregien, Nathan and Loewe, Martha},
abstractNote = {Names in programming are vital for understanding the meaning of code and big data. We define code2brain (C2B) interfaces as maps in compilers and brains between meaning and naming syntax, which help to understand executable code. While working toward an Evolvix syntax for general‐purpose programming that makes accurate modeling easy for biologists, we observed how names affect C2B quality. To protect learning and coding investments, C2B interfaces require long‐term backward compatibility and semantic reproducibility (accurate reproduction of computational meaning from coder‐brains to reader‐brains by code alone). Semantic reproducibility is often assumed until confusing synonyms degrade modeling in biology to deciphering exercises. We highlight empirical naming priorities from diverse individuals and roles of names in different modes of computing to show how naming easily becomes impossibly difficult. We present the Evolvix BEST (Brief, Explicit, Summarizing, Technical) Names concept for reducing naming priority conflicts, test it on a real challenge by naming subfolders for the Project Organization Stabilizing Tool system, and provide naming questionnaires designed to facilitate C2B debugging by improving names used as keywords in a stabilizing programming language. Our experiences inspired us to develop Evolvix using a flipped programming language design approach with some unexpected features and BEST Names at its core.},
doi = {10.1111/nyas.13192},
journal = {Annals of the New York Academy of Sciences},
number = 1,
volume = 1387,
place = {United States},
year = {Mon Dec 05 00:00:00 EST 2016},
month = {Mon Dec 05 00:00:00 EST 2016}
}
https://doi.org/10.1111/nyas.13192
Works referenced in this record:
Reproducing Statistical Results
journal, April 2015
- Stodden, Victoria
- Annual Review of Statistics and Its Application, Vol. 2, Issue 1
The case for open computer programs
journal, February 2012
- Ince, Darrel C.; Hatton, Leslie; Graham-Cumming, John
- Nature, Vol. 482, Issue 7386
α-Amylase Inhibitors: A Review of Raw Material and Isolated Compounds from Plant Source
journal, February 2012
- Sales, Paloma Michelle; Souza, Paula Monteiro; Simeoni, Luiz Alberto
- Journal of Pharmacy & Pharmaceutical Sciences, Vol. 15, Issue 1
Towards a whole-cell modeling approach for synthetic biology
journal, June 2013
- Purcell, Oliver; Jain, Bonny; Karr, Jonathan R.
- Chaos: An Interdisciplinary Journal of Nonlinear Science, Vol. 23, Issue 2
Quantifying the threat of extinction from Muller's ratchet in the diploid Amazon molly (Poecilia formosa)
journal, January 2008
- Loewe, Laurence; Lamatsch, Dunja K.
- BMC Evolutionary Biology, Vol. 8, Issue 1
The Economics of Reproducibility in Preclinical Research
journal, June 2015
- Freedman, Leonard P.; Cockburn, Iain M.; Simcoe, Timothy S.
- PLOS Biology, Vol. 13, Issue 6
Evolution@home: observations on participant choice, work unit variation and low-effort global computing
journal, January 2007
- Loewe, Laurence
- Software: Practice and Experience, Vol. 37, Issue 12
Parallel Reproducible Summation
journal, July 2015
- Demmel, James; Nguyen, Hong Diep
- IEEE Transactions on Computers, Vol. 64, Issue 7
A C library for retrieving specific reactions from the BioModels database
journal, September 2013
- Neal, M. L.; Galdzicki, M.; Gallimore, J. T.
- Bioinformatics, Vol. 30, Issue 1
Cytoscape 2.8: new features for data integration and network visualization
journal, December 2010
- Smoot, M. E.; Ono, K.; Ruscheinski, J.
- Bioinformatics, Vol. 27, Issue 3
Amylase—Its Clinical Significance: a Review of the Literature
journal, January 1976
- Salt, William B.; Schenker, Steven
- Medicine, Vol. 55, Issue 4
Summary of the DREAM8 Parameter Estimation Challenge: Toward Parameter Identification for Whole-Cell Models
journal, May 2015
- Karr, Jonathan R.; Williams, Alex H.; Zucker, Jeremy D.
- PLOS Computational Biology, Vol. 11, Issue 5
Quantifying the genomic decay paradox due to Muller's ratchet in human mitochondrial DNA
journal, April 2006
- Loewe, Laurence
- Genetical Research, Vol. 87, Issue 2
Is salivary alpha-amylase an indicator of autonomic nervous system dysregulations in mental disorders?—A review of preliminary findings and the interactions with cortisol
journal, June 2013
- Schumacher, Sarah; Kirschbaum, Clemens; Fydrich, Thomas
- Psychoneuroendocrinology, Vol. 38, Issue 6
Partitioning the UMLS semantic network
journal, June 2002
- Zong Chen, ; Perl, Y.; Halper, M.
- IEEE Transactions on Information Technology in Biomedicine, Vol. 6, Issue 2
Reproducibility: changing the policies and culture of cell line authentication
journal, May 2015
- Freedman, Leonard P.; Gibson, Mark C.; Ethier, Stephen P.
- Nature Methods, Vol. 12, Issue 6
Semantic reclassification of the UMLS concepts
journal, July 2008
- Fan, J. -W.; Friedman, C.
- Bioinformatics, Vol. 24, Issue 17
A proprietary alpha-amylase inhibitor from white bean (Phaseolus vulgaris): A review of clinical studies on weight loss and glycemic control
journal, March 2011
- Barrett, Marilyn L.; Udani, Jay K.
- Nutrition Journal, Vol. 10, Issue 1
Annotation and merging of SBML models with semanticSBML
journal, November 2009
- Krause, F.; Uhlendorf, J.; Lubitz, T.
- Bioinformatics, Vol. 26, Issue 3
WholeCellSimDB: a hybrid relational/HDF database for whole-cell model predictions
journal, January 2014
- Karr, J. R.; Phillips, N. C.; Covert, M. W.
- Database, Vol. 2014, Issue 0
A data integration methodology for systems biology
journal, November 2005
- Hwang, D.; Rust, A. G.; Ramsey, S.
- Proceedings of the National Academy of Sciences, Vol. 102, Issue 48
On the potential for extinction by Muller's Ratchet in Caenorhabditis elegans
journal, January 2008
- Loewe, Laurence; Cutter, Asher D.
- BMC Evolutionary Biology, Vol. 8, Issue 1
Overcoming an obstacle in expanding a UMLS semantic type extent
journal, February 2012
- Chen, Yan; Gu, Huanying; Perl, Yehoshua
- Journal of Biomedical Informatics, Vol. 45, Issue 1
A Whole-Cell Computational Model Predicts Phenotype from Genotype
journal, July 2012
- Karr, Jonathan R.; Sanghvi, Jayodita C.; Macklin, Derek N.
- Cell, Vol. 150, Issue 2
Analysis of the semantic structures of medical reference languages: Part 2. Analysis of the semantic power of MeSH, ICD and SNOMED
journal, January 1978
- Major, Pauline; Kostrewski, B. J.; Anderson, J.
- Medical Informatics, Vol. 3, Issue 4
A Reappraisal of How to Build Modular, Reusable Models of Biological Systems
journal, October 2014
- Neal, Maxwell L.; Cooling, Michael T.; Smith, Lucian P.
- PLoS Computational Biology, Vol. 10, Issue 10
WholeCellKB: model organism databases for comprehensive whole-cell models
journal, November 2012
- Karr, Jonathan R.; Sanghvi, Jayodita C.; Macklin, Derek N.
- Nucleic Acids Research, Vol. 41, Issue D1
Lazy Updating of hubs can enable more realistic models by speeding up stochastic simulations
journal, November 2014
- Ehlert, Kurt; Loewe, Laurence
- The Journal of Chemical Physics, Vol. 141, Issue 20
Introduction
book, January 2012
- Doan, AnHai; Halevy, Alon; Ives, Zachary
- Principles of Data Integration
A Review on Structure–Activity Relationship of Dietary Polyphenols Inhibiting α-Amylase
journal, January 2013
- Xiao, Jianbo; Ni, Xiaoling; Kai, Guoyin
- Critical Reviews in Food Science and Nutrition, Vol. 53, Issue 5
Systems in Evolutionary Systems Biology
book, January 2016
- Loewe, L.
- Encyclopedia of Evolutionary Biology
Exercise upregulates salivary amylase in humans (Review)
journal, January 2014
- Koibuchi, Eri; Suzuki, Yoshio
- Experimental and Therapeutic Medicine, Vol. 7, Issue 4
NixOS: A purely functional Linux distribution
journal, October 2010
- Dolstra, Eelco; LÖH, Andres; Pierron, Nicolas
- Journal of Functional Programming, Vol. 20, Issue 5-6
Reproducible Research in Computational Harmonic Analysis
journal, January 2009
- Donoho, David L.; Maleki, Arian; Rahman, Inam Ur
- Computing in Science & Engineering, Vol. 11, Issue 1
Corleone: hands-off crowdsourcing for entity matching
conference, June 2014
- Gokhale, Chaitanya; Das, Sanjib; Doan, AnHai
- SIGMOD/PODS'14: International Conference on Management of Data, Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data
The principles of whole-cell modeling
journal, October 2015
- Karr, Jonathan R.; Takahashi, Koichi; Funahashi, Akira
- Current Opinion in Microbiology, Vol. 27
A lexical metaschema for the UMLS semantic network
journal, January 2005
- Zhang, Li; Perl, Yehoshua; Halper, Michael
- Artificial Intelligence in Medicine, Vol. 33, Issue 1
Nomenclature- and Database-Compatible Names for the Two Ebola Virus Variants that Emerged in Guinea and the Democratic Republic of the Congo in 2014
journal, November 2014
- Kuhn, Jens; Andersen, Kristian; Baize, Sylvain
- Viruses, Vol. 6, Issue 11
WholeCellViz: data visualization for whole-cell models
journal, January 2013
- Lee, Ruby; Karr, Jonathan R.; Covert, Markus W.
- BMC Bioinformatics, Vol. 14, Issue 1
Application of microbial α-amylase in industry - A review
journal, January 2010
- Souza, Paula Monteiro de; Magalhães, Pérola de Oliveira e.
- Brazilian Journal of Microbiology, Vol. 41, Issue 4
Programming Languages shouldn't be "too Natural"
journal, February 2015
- Mandrioli, Dino; Pradella, Matteo
- ACM SIGSOFT Software Engineering Notes, Vol. 40, Issue 1
Titanium: a high-performance Java dialect
journal, September 1998
- Yelick, Kathy; Semenzato, Luigi; Pike, Geoff
- Concurrency: Practice and Experience, Vol. 10, Issue 11-13
Concepts and Synonymy in the UMLS Metathesaurus
journal, October 2009
- Merrill, Gary
- Journal of Biomedical Discovery and Collaboration, Vol. 4
Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences
journal, January 2010
- Goecks, Jeremy; Nekrutenko, Anton; Taylor, James
- Genome Biology, Vol. 11, Issue 8
Accelerated discovery via a whole-cell model
journal, November 2013
- Sanghvi, Jayodita C.; Regot, Sergi; Carrasco, Silvia
- Nature Methods, Vol. 10, Issue 12