๐ Introduction
When working with data, understanding statistics is essential.
With NumPy, you can easily perform statistical analysis using built-in functions.
๐ Why Use Statistical Functions?
Statistical functions help you:
- Analyze data trends
- Summarize datasets
- Prepare data for machine learning
๐งช Example Dataset
import numpy as npdata = np.array([10, 20, 30, 40, 50])
๐ 1. Mean (Average)
The mean is the average of all values.
print(np.mean(data))
๐ Output:
30.0
๐ 2. Median
The median is the middle value.
print(np.median(data))
๐ Output:
30.0
๐ 3. Standard Deviation (std)
Shows how spread out values are.
print(np.std(data))
๐ Output:
14.1421356237
๐ฆ 4. Variance
Variance is the square of standard deviation.
print(np.var(data))
๐ 5. Min & Max
print(np.min(data))
print(np.max(data))
๐ Output:
10
50
๐ 6. Axis-Based Operations (2D Arrays)
data = np.array([[1,2,3],[4,5,6]])print(np.mean(data, axis=0)) # column-wise
print(np.mean(data, axis=1)) # row-wise
โก Why NumPy is Efficient?
NumPy performs:
- Fast calculations
- Vectorized operations
- Optimized performance
๐ฆ Real-World Use Case
marks = np.array([70, 80, 90, 60])average = np.mean(marks)
print("Average Marks:", average)
โ Useful in:
- Student analysis
- Sales reports
- Data science
๐ Used with Other Libraries
Statistical functions are widely used in:
- Pandas
- Scikit-learn
- TensorFlow
๐ Summary Table
| Function | Description |
|---|---|
| mean() | Average |
| median() | Middle value |
| std() | Standard deviation |
| var() | Variance |
| min() | Minimum |
| max() | Maximum |
๐ง Pro Tips
- Use
axisfor multi-dimensional data - Combine multiple functions for insights
- Always clean data before analysis
๐ Conclusion
Statistical functions in NumPy make data analysis simple, fast, and powerful.