Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Xray: extended arrays for scientific datasets by Stephan Hoyer PyData SV 2014

893 views

Published on

xray is a new Python package for labeled array data. It aims to provide a data analysis toolkit as efficient and powerful as pandas but designed for homogeneous N-dimensional arrays instead of tabular data. Indeed, many of its internals are built on pandas (most notably, fast indexing), and its interface mirrors pandas for features such as label-based indexing, data alignment and group-by operations. xray implements two data-structures that are missing in pandas: the DataArray, an extended array object with labeled coordinates and dimensions, and the Dataset, a dictionary-like container for manipulating a collection of DataArrays aligned along shared dimensions. The labeled dimensions of the DataArray allow for array alignment (e.g., broadcasting) and operations (e.g., sum) based on dimension names instead of array shapes and axis numbers. The data model is based on Unidata’s Common Data Model for self-describing scientific datasets, which is widely used in the geosciences.

Published in: Technology
  • Be the first to comment

  • Be the first to like this

Xray: extended arrays for scientific datasets by Stephan Hoyer PyData SV 2014

  1. 1. Introducing xray: extended arrays for scientific datasets By Stephan Hoyer http://nbviewer.ipython.org/gist/shoyer/497789ebabb3a82411b2

×