PyData
  NYC 2012
Python and Big Data
• Python has become an established language for
 scientific, engineering, and technical computing

• Ubiquitous in industry for all kinds of problems
 large and small
 - National labs
 - Finance
 - Oil & Gas
 - Consumer Products
 - Aerospace / Defense
Next Steps
• Challenge is now to build/expand this community around
 out-of-core and distributed data structures and
 algorithms.

• A unifying focus of the PyData community.
• The accessibility of Python syntax will empower the next
 generation of “data scientists” just as it has empowered
 this generation of “real” scientists.
A few thoughts on how
• Integration with as much as possible: Python is and
 should remain the ultimate glue (need Python .JVM
 equivalent of Python .NET)

• Compelling new features (Python compiler, out-of-
 core data structures, R-inspired data-frames with
 hierarchical indexing, meta-data enhanced plotting)

• Focus on pragmatic solving of real problems easily not
 on language fanboyism or feature-fetish. Empower
 domain experts and the occasional programmer.
Thank you!
• Our Sponsors
  -   DE Shaw and Co
  -   Appnexus
  -   JPMorgan
  -   NumFOCUS
  -   PSF

• Our Organizers:      People at Continuum Analytics, Inc., Lambda-Foundry


• Our Speakers: They put a lot of effort into both the talks and the
 work being discussed

• Attendees: your participation makes this all possible!
Reminders
• Sign up for the hack-a-thon: http://pydata.eventbrite.com/
 - Bring your photo-ID that matches your registration name
 - Need to be on the list to be admitted. List being sent over at noon
 - Sign up to lead BOFs, sprints, or demos at wiki page: http://tinyurl.com/
   pydata-sprints
• Register for Dinner tonight (included in ticket)
 - Continuum will present a brief overview of our products and services
   and answer questions but otherwise it will be lively discussion among
   friends
  -Special networking track for students and other people looking for jobs
• PyData West in Santa Clara: March 19-21, 2013
  - http://pydatawest2013.pydata.org
PyData
  NYC 2012

PyData Introduction

  • 1.
  • 2.
    Python and BigData • Python has become an established language for scientific, engineering, and technical computing • Ubiquitous in industry for all kinds of problems large and small - National labs - Finance - Oil & Gas - Consumer Products - Aerospace / Defense
  • 3.
    Next Steps • Challengeis now to build/expand this community around out-of-core and distributed data structures and algorithms. • A unifying focus of the PyData community. • The accessibility of Python syntax will empower the next generation of “data scientists” just as it has empowered this generation of “real” scientists.
  • 4.
    A few thoughtson how • Integration with as much as possible: Python is and should remain the ultimate glue (need Python .JVM equivalent of Python .NET) • Compelling new features (Python compiler, out-of- core data structures, R-inspired data-frames with hierarchical indexing, meta-data enhanced plotting) • Focus on pragmatic solving of real problems easily not on language fanboyism or feature-fetish. Empower domain experts and the occasional programmer.
  • 5.
    Thank you! • OurSponsors - DE Shaw and Co - Appnexus - JPMorgan - NumFOCUS - PSF • Our Organizers: People at Continuum Analytics, Inc., Lambda-Foundry • Our Speakers: They put a lot of effort into both the talks and the work being discussed • Attendees: your participation makes this all possible!
  • 6.
    Reminders • Sign upfor the hack-a-thon: http://pydata.eventbrite.com/ - Bring your photo-ID that matches your registration name - Need to be on the list to be admitted. List being sent over at noon - Sign up to lead BOFs, sprints, or demos at wiki page: http://tinyurl.com/ pydata-sprints • Register for Dinner tonight (included in ticket) - Continuum will present a brief overview of our products and services and answer questions but otherwise it will be lively discussion among friends -Special networking track for students and other people looking for jobs • PyData West in Santa Clara: March 19-21, 2013 - http://pydatawest2013.pydata.org
  • 7.

Editor's Notes