2. Pandas
2
o Pyton Data analysis library
o Built on top of Numpy
o Abbreviation of Panel Data System
o Used in production in many companies
3. T h e I d e a l t o o l f o r
d a t a S c i e n t i s t s
3
oManaging data
oCleaning data
oAnalyzing
oModeling data
oOrganizing the data in a form suitable for plotting or tabular
display
9. oPython DataFrame is a data structure containing and
ordered collecetions of columns.
oEach column may hold numeric, string, boolean etc.
Values
oDataFrame has both row and column index
D a t a F r a m e
9
10. oA pandas DataFrame can be created using various inputs
like
--Lists
--Dict
--Series
--Numpy ndarrays
--Another DataFrame
C r e a t i n g a D a t a F r a m e
10
26. P y t h o n P a n d a s
I n p u t / O u t p u t T O O L S
oThe Pandas I/O API is a set of top level reader functions accessed like
pd.read_csv() that generally return a Pandas object.
oThe two functions for reading text files are read_csv() and
read_table(). They both intelligently convert tabular data into a
DataFrame object
26