Dataset info
Number of variables | 7 |
---|---|
Number of observations | 150 |
Missing cells | 0 (0.0%) |
Duplicate rows | 0 (0.0%) |
Total size in memory | 8.3 KiB |
Average record size in memory | 56.9 B |
Variables types
Numeric | 4 |
---|---|
Categorical | 2 |
Boolean | 0 |
Date | 0 |
URL | 0 |
Text (Unique) | 0 |
Rejected | 1 |
Unsupported | 0 |
Warnings
petal_width is highly correlated with petal_length (ρ = 0.9627570971) | Rejected |
petal_length
Numeric
Distinct count | 43 |
---|---|
Unique (%) | 28.7% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 3.758666667 |
---|---|
Minimum | 1 |
Maximum | 6.9 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1.3 |
Q1 | 1.6 |
Median | 4.35 |
Q3 | 5.1 |
95-th percentile | 6.1 |
Maximum | 6.9 |
Range | 5.9 |
Interquartile range | 3.5 |
Descriptive statistics
Standard deviation | 1.76442042 |
---|---|
Coef of variation | 0.4694272135 |
Kurtosis | -1.401920801 |
Mean | 3.758666667 |
MAD | 1.56192 |
Skewness | -0.2744642525 |
Sum | 563.8 |
Variance | 3.113179418 |
Memory size | 1.3 KiB |
Histogram with fixed size bins (bins=43)
Histogram with variable size bins (bins=[1. 1.25 1.65 3.85 5.85 6.9 ], "bayesian blocks" binning strategy used)
Value | Count | Frequency (%) | |
1.5 | 14 | 9.3% | |
1.4 | 12 | 8.0% | |
5.1 | 8 | 5.3% | |
4.5 | 8 | 5.3% | |
1.3 | 7 | 4.7% | |
1.6 | 7 | 4.7% | |
5.6 | 6 | 4.0% | |
4 | 5 | 3.3% | |
4.9 | 5 | 3.3% | |
4.7 | 5 | 3.3% | |
Other values (33) | 73 | 48.7% |
Minimum 5 values
Value | Count | Frequency (%) | |
1 | 1 | 0.7% | |
1.1 | 1 | 0.7% | |
1.2 | 2 | 1.3% | |
1.3 | 7 | 4.7% | |
1.4 | 12 | 8.0% |
Maximum 5 values
Value | Count | Frequency (%) | |
6.9 | 1 | 0.7% | |
6.7 | 2 | 1.3% | |
6.6 | 1 | 0.7% | |
6.4 | 1 | 0.7% | |
6.3 | 1 | 0.7% |
petal_width
Highly correlated
This variable is highly correlated with petal_length
and should be ignored for analysis
Correlation | 0.9627570971 |
---|
sepal_length
Numeric
Distinct count | 35 |
---|---|
Unique (%) | 23.3% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 5.843333333 |
---|---|
Minimum | 4.3 |
Maximum | 7.9 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 4.3 |
---|---|
5-th percentile | 4.6 |
Q1 | 5.1 |
Median | 5.8 |
Q3 | 6.4 |
95-th percentile | 7.255 |
Maximum | 7.9 |
Range | 3.6 |
Interquartile range | 1.3 |
Descriptive statistics
Standard deviation | 0.828066128 |
---|---|
Coef of variation | 0.1417112598 |
Kurtosis | -0.5520640413 |
Mean | 5.843333333 |
MAD | 0.6875555556 |
Skewness | 0.3149109566 |
Sum | 876.5 |
Variance | 0.6856935123 |
Memory size | 1.3 KiB |
Histogram with fixed size bins (bins=35)
Histogram with variable size bins (bins=[4.3 4.75 6.95 7.9 ], "bayesian blocks" binning strategy used)
Value | Count | Frequency (%) | |
5 | 10 | 6.7% | |
6.3 | 9 | 6.0% | |
5.1 | 9 | 6.0% | |
6.7 | 8 | 5.3% | |
5.7 | 8 | 5.3% | |
5.5 | 7 | 4.7% | |
5.8 | 7 | 4.7% | |
6.4 | 7 | 4.7% | |
6 | 6 | 4.0% | |
4.9 | 6 | 4.0% | |
Other values (25) | 73 | 48.7% |
Minimum 5 values
Value | Count | Frequency (%) | |
4.3 | 1 | 0.7% | |
4.4 | 3 | 2.0% | |
4.5 | 1 | 0.7% | |
4.6 | 4 | 2.7% | |
4.7 | 2 | 1.3% |
Maximum 5 values
Value | Count | Frequency (%) | |
7.9 | 1 | 0.7% | |
7.7 | 4 | 2.7% | |
7.6 | 1 | 0.7% | |
7.4 | 1 | 0.7% | |
7.3 | 1 | 0.7% |
sepal_width
Numeric
Distinct count | 23 |
---|---|
Unique (%) | 15.3% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 3.054 |
---|---|
Minimum | 2 |
Maximum | 4.4 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 2.345 |
Q1 | 2.8 |
Median | 3 |
Q3 | 3.3 |
95-th percentile | 3.8 |
Maximum | 4.4 |
Range | 2.4 |
Interquartile range | 0.5 |
Descriptive statistics
Standard deviation | 0.4335943114 |
---|---|
Coef of variation | 0.1419758714 |
Kurtosis | 0.2907810624 |
Mean | 3.054 |
MAD | 0.3330933333 |
Skewness | 0.3340526622 |
Sum | 458.1 |
Variance | 0.1880040268 |
Memory size | 1.3 KiB |
Histogram with fixed size bins (bins=23)
Histogram with variable size bins (bins=[2. 2.65 3.45 3.85 4.4 ], "bayesian blocks" binning strategy used)
Value | Count | Frequency (%) | |
3 | 26 | 17.3% | |
2.8 | 14 | 9.3% | |
3.2 | 13 | 8.7% | |
3.4 | 12 | 8.0% | |
3.1 | 12 | 8.0% | |
2.9 | 10 | 6.7% | |
2.7 | 9 | 6.0% | |
2.5 | 8 | 5.3% | |
3.5 | 6 | 4.0% | |
3.8 | 6 | 4.0% | |
Other values (13) | 34 | 22.7% |
Minimum 5 values
Value | Count | Frequency (%) | |
2 | 1 | 0.7% | |
2.2 | 3 | 2.0% | |
2.3 | 4 | 2.7% | |
2.4 | 3 | 2.0% | |
2.5 | 8 | 5.3% |
Maximum 5 values
Value | Count | Frequency (%) | |
4.4 | 1 | 0.7% | |
4.2 | 1 | 0.7% | |
4.1 | 1 | 0.7% | |
4 | 1 | 0.7% | |
3.9 | 2 | 1.3% |
species
Categorical
Distinct count | 3 |
---|---|
Unique (%) | 2.0% |
Missing (%) | 0.0% |
Missing (n) | 0 |
virginica | |
---|---|
versicolor | |
setosa |
Value | Count | Frequency (%) | |
virginica | 50 | 33.3% | |
versicolor | 50 | 33.3% | |
setosa | 50 | 33.3% |
Max length | 10 |
---|---|
Mean length | 8.333333333 |
Min length | 6 |
Contains chars | True |
Contains digits | False |
Contains spaces | False |
Contains non-words | False |
species_2_cat
Categorical
Distinct count | 2 |
---|---|
Unique (%) | 1.3% |
Missing (%) | 0.0% |
Missing (n) | 0 |
v | |
---|---|
s |
Value | Count | Frequency (%) | |
v | 100 | 66.7% | |
s | 50 | 33.3% |
Max length | 1 |
---|---|
Mean length | 1 |
Min length | 1 |
Contains chars | True |
Contains digits | False |
Contains spaces | False |
Contains non-words | False |
Unnamed_0
Numeric
Distinct count | 150 |
---|---|
Unique (%) | 100.0% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 74.5 |
---|---|
Minimum | 0 |
Maximum | 149 |
Zeros (%) | 0.7% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 7.45 |
Q1 | 37.25 |
Median | 74.5 |
Q3 | 111.75 |
95-th percentile | 141.55 |
Maximum | 149 |
Range | 149 |
Interquartile range | 74.5 |
Descriptive statistics
Standard deviation | 43.44536799 |
---|---|
Coef of variation | 0.5831593019 |
Kurtosis | -1.2 |
Mean | 74.5 |
MAD | 37.5 |
Skewness | 0 |
Sum | 11175 |
Variance | 1887.5 |
Memory size | 1.3 KiB |
Histogram with fixed size bins (bins=50)
Histogram with variable size bins (bins=[ 0. 149.], "bayesian blocks" binning strategy used)
Value | Count | Frequency (%) | |
149 | 1 | 0.7% | |
55 | 1 | 0.7% | |
53 | 1 | 0.7% | |
52 | 1 | 0.7% | |
51 | 1 | 0.7% | |
50 | 1 | 0.7% | |
49 | 1 | 0.7% | |
48 | 1 | 0.7% | |
47 | 1 | 0.7% | |
46 | 1 | 0.7% | |
Other values (140) | 140 | 93.3% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 1 | 0.7% | |
1 | 1 | 0.7% | |
2 | 1 | 0.7% | |
3 | 1 | 0.7% | |
4 | 1 | 0.7% |
Maximum 5 values
Value | Count | Frequency (%) | |
149 | 1 | 0.7% | |
148 | 1 | 0.7% | |
147 | 1 | 0.7% | |
146 | 1 | 0.7% | |
145 | 1 | 0.7% |
First rows
petal_length | petal_width | sepal_length | sepal_width | species | species_2_cat | Unnamed_0 | |
---|---|---|---|---|---|---|---|
0 | 1.4 | 0.2 | 5.1 | 3.5 | setosa | s | 0 |
1 | 1.4 | 0.2 | 4.9 | 3.0 | setosa | s | 1 |
2 | 1.3 | 0.2 | 4.7 | 3.2 | setosa | s | 2 |
3 | 1.5 | 0.2 | 4.6 | 3.1 | setosa | s | 3 |
4 | 1.4 | 0.2 | 5.0 | 3.6 | setosa | s | 4 |
5 | 1.7 | 0.4 | 5.4 | 3.9 | setosa | s | 5 |
6 | 1.4 | 0.3 | 4.6 | 3.4 | setosa | s | 6 |
7 | 1.5 | 0.2 | 5.0 | 3.4 | setosa | s | 7 |
8 | 1.4 | 0.2 | 4.4 | 2.9 | setosa | s | 8 |
9 | 1.5 | 0.1 | 4.9 | 3.1 | setosa | s | 9 |
Last rows
petal_length | petal_width | sepal_length | sepal_width | species | species_2_cat | Unnamed_0 | |
---|---|---|---|---|---|---|---|
140 | 5.6 | 2.4 | 6.7 | 3.1 | virginica | v | 140 |
141 | 5.1 | 2.3 | 6.9 | 3.1 | virginica | v | 141 |
142 | 5.1 | 1.9 | 5.8 | 2.7 | virginica | v | 142 |
143 | 5.9 | 2.3 | 6.8 | 3.2 | virginica | v | 143 |
144 | 5.7 | 2.5 | 6.7 | 3.3 | virginica | v | 144 |
145 | 5.2 | 2.3 | 6.7 | 3.0 | virginica | v | 145 |
146 | 5.0 | 1.9 | 6.3 | 2.5 | virginica | v | 146 |
147 | 5.2 | 2.0 | 6.5 | 3.0 | virginica | v | 147 |
148 | 5.4 | 2.3 | 6.2 | 3.4 | virginica | v | 148 |
149 | 5.1 | 1.8 | 5.9 | 3.0 | virginica | v | 149 |