<p><strong>Background: Transparency, reproducibility and reusability of scientific analysis</strong></p><p>Researchers are now widely expected to share the data and source code of their work&#160;to foster transparency, reproducibility and reusability.<br>Alas, the quality of data documentation and scientific software scripts&#160;can vary substantially. In many instances, metadata and information on the provenance of data&#160;are missing or incomplete, and source code often does not include a clear list of dependencies&#160;(including version information) or systems requirements.&#160;Finally, the source code does not include sufficient inline documentation&#160;to be easily understood.&#160;As a consequence, even though the data and related scripts may be released&#160;under an open-source license, analysis too often cannot be reproduced or adapted&#160;with reasonable effort by other researchers.</p><p><strong>The pyam package</strong></p><p>This talk presents the open-source Python package <strong>pyam</strong> for energy system scenario analysis&#160;and visualization. The aim of pyam is not to provide any ground-breaking new methods&#160;or analysis routines. Instead, it provides a reliable, well-tested interface<br>similar in feel & style to the widely used <strong>pandas</strong>&#160;package,&#160;but geared for data formats and applications often used in energy systems analysis<br>and integrated assessment modelling.</p><p>By using pyam for their scenario input data processing and analysis workflows,&#160;researchers can reduce standard tasks like unit conversion and data validation&#160;from a 5-minute effort to 30 seconds - and have the knowledge that their scripts&#160;won't break if pandas or another dependency change their APIs,&#160;because the pyam community will work to ensure forward-compatibility and continuity.&#160;As another side benefit, the pyam package will raise meaningful errors when input data doesn't make sense,&#160;whereas own ad-hoc scripts may fail silently or - even worse - return non-sensical values.</p><p><strong>Spatial, temporal and sectoral aggregation & downscaling features</strong></p><p>To highlight the applicability of the pyam package for the EGU community&#160;and energy & climate modellers at large,&#160;this talk will focus on the features for spatial, temporal and sectoral aggregation&#160;and downscaling. The package include several often-used methods like weighting by&#160;proxy variables or deriving indicators based on minimum or maximum values of timeseries data.</p><p><strong>Building a community</strong></p><p>The pyam package follows best-practice of version control, continuous-integration&#160;and scientific-software documentation. This facilitates building on the package by&#160;other researchers. The community uses several tools for communication and discussion, including a Slack channel, an email list and a Github repository for issues & pull requests. And of course, we appreciate contributions by colleagues&#160;to extend the scope of features and methods based on their own use cases and requirements!</p><p><strong>More information</strong></p><p>ReadTheDocs: https://pyam-iamc.readthedocs.io/<br>GitHub repo: https://github.com/iamconsortium/pyam</p>