JavaScript seems to be disabled in your browser.
You must have JavaScript enabled in your browser to utilize the functionality of this website.

Describe the summary statistics of DataFrame in Pandas

Summary statistics:

import pandas as pd

df = pd.DataFrame([[10, 20, 30, 40], [7, 14, 21, 28], [55, 15, 8, 12],
                   [15, 14, 1, 8], [7, 1, 1, 8], [5, 4, 9, 2]],
                  columns=['Apple', 'Orange', 'Banana', 'Pear'],
                  index=['Basket1', 'Basket2', 'Basket3', 'Basket4',
                         'Basket5', 'Basket6'])

print("\n----------- Describe DataFrame -----------\n")
print(df.describe())

print("\n----------- Describe Column -----------\n")
print(df[['Apple']].describe())

C:\pandas>python example.py
 
----------- Describe DataFrame -----------
 
           Apple     Orange     Banana       Pear
count   6.000000   6.000000   6.000000   6.000000
mean   16.500000  11.333333  11.666667  16.333333
std    19.180719   7.257180  11.587349  14.555640
min     5.000000   1.000000   1.000000   2.000000
25%     7.000000   6.500000   2.750000   8.000000
50%     8.500000  14.000000   8.500000  10.000000
75%    13.750000  14.750000  18.000000  24.000000
max    55.000000  20.000000  30.000000  40.000000
 
----------- Describe Column -----------
 
           Apple
count   6.000000
mean   16.500000
std    19.180719
min     5.000000
25%     7.000000
50%     8.500000
75%    13.750000
max    55.000000
 
C:\pandas>

Creating a Series using List and Dictionary

Create and Print DataFrame

Set Index and Columns of DataFrame

Rename DataFrame Columns

select rows from a DataFrame using operator

Filter DataFrame rows using isin

Example of iterrows and itertuples

Drop DataFrame Column(s) by Name or Index

Add new column to DataFrame

Get list of the column headers

Generate DataFrame with random values

Select multiple columns from DataFrame

Create series using NumPy functions

Get index and values of a series

Specify an Index at Series creation

Get Length Size and Shape of a Series

Example of Heads, Tails and Takes

Slicing a Series into subsets

DataFrame slicing using loc

DataFrame slicing using iloc

loc vs iloc slicing in DataFrame

Reindex DataFrame columns

Determine DataFrame columns data type

Change DataFrame column data type from Int64 to String

Change DataFrame column data-type from UnixTime to DateTime

Alter DataFrame column data type from Float64 to Int32

Alter DataFrame column data type from Object to Datetime64

Convert Dictionary into DataFrame

Appending two DataFrame objects

Add row with specific index name

Append rows using a for loop

Add a row at top

Dynamically Add Rows to DataFrame

Insert a row at an arbitrary position

Adding row to DataFrame with time stamp index

Adding rows with different column names

Example of append, concat and combine_first

Get mean(average) of rows and columns

Calculate sum across rows and columns

Join two columns

Empty DataFrame with Date Index

Filter rows which contain specific keyword

Filtering DataFrame Index

Filtering DataFrame with an AND operator

Find all rows contain a Sub-string

Example of using any()

Example of where()

Count number of rows per group

Get Unique row values

DataFrame is empty

Count Distinct Values

Remove duplicate rows based on two columns

Remove duplicate rows

Get value of a specific cell

Get scalar value of a cell using conditional indexing

Remove duplicate rows

Get list of cell value conditionally

Replace values in column with a dictionary

Count distinct equivalent

Handle missing data

Delete missing data rows

Drop columns with missing data

Sort Index in descending order

Sort Column in descending order

Determine Rank of DataFrame values

Multiple Indexing

Specify Index and Column for DataFrame

Determine Period Index and Column for DataFrame

Determine Period Range with Frequency

Import CSV with specific Index

Writing DataFrame to CSV file

Read specific columns from CSV

Get list of CSV columns

Find row where values for column is maximum

Complex filter data using query method

Check if one or more columns all exist

Locating the n-smallest and n-largest values

Finding minimum and maximum values

Find index position of minimum and maximum values

Calculation of a cumulative product and sum

Summary statistics of DataFrame

Find Mean, Median and Mode

Measure Variance and Standard Deviation

Calculating the percent change at each cell of a DataFrame

Forward and backward filling of missing values

Calculating correlation between two DataFrame

Calculating Co-variance

Stacking using non-hierarchical indexes

Unstacking using hierarchical indexes