Dataset statistics
Number of variables | 12 |
---|---|
Number of observations | 271730 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Total size in memory | 14.3 MiB |
Average record size in memory | 55.0 B |
Variable types
DateTime | 1 |
---|---|
Categorical | 6 |
Text | 1 |
Numeric | 4 |
Reproduction
Analysis started | 2023-12-29 02:48:48.531344 |
---|---|
Analysis finished | 2023-12-29 02:48:49.755588 |
Duration | 1.22 second |
Software version | ydata-profiling vv4.6.3 |
Download configuration | config.json |
date
Date
Distinct | 202 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.1 MiB |
Minimum | 2007-01-15 00:00:00 |
---|---|
Maximum | 2023-11-15 00:00:00 |
market
Categorical
HIGH CARDINALITY
 
Distinct | 215 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 540.6 KiB |
National Average | 1663 |
---|---|
Pasar Tenguyun | 1443 |
Pasar Gusher | 1437 |
Pasar Pancasila | 1437 |
Pasar Segiri | 1436 |
Other values (210) |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | National Average |
---|---|
2nd row | National Average |
3rd row | National Average |
4th row | National Average |
5th row | National Average |
Common Values
Value | Count | Frequency (%) |
National Average | 1663 | 0.6% |
Pasar Tenguyun | 1443 | 0.5% |
Pasar Gusher | 1437 | 0.5% |
Pasar Pancasila | 1437 | 0.5% |
Pasar Segiri | 1436 | 0.5% |
Pasar Oeba | 1435 | 0.5% |
Pasar Cikurubuk | 1435 | 0.5% |
Pasar Pelita | 1434 | 0.5% |
Pasar Mandonga | 1433 | 0.5% |
Pasar Kota | 1433 | 0.5% |
Other values (205) | 257144 |
category
Categorical
Distinct | 7 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 265.8 KiB |
vegetables and fruits | |
---|---|
meat, fish and eggs | |
cereals and tubers | |
oil and fats | |
miscellaneous food | |
Other values (2) | 242 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | cereals and tubers |
---|---|
2nd row | cereals and tubers |
3rd row | meat, fish and eggs |
4th row | meat, fish and eggs |
5th row | meat, fish and eggs |
Common Values
Value | Count | Frequency (%) |
vegetables and fruits | 100694 | |
meat, fish and eggs | 72334 | |
cereals and tubers | 40930 | |
oil and fats | 29639 | 10.9% |
miscellaneous food | 27891 | 10.3% |
milk and dairy | 158 | 0.1% |
non-food | 84 | < 0.1% |
Common Values (Plot)
commodity
Categorical
Distinct | 30 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 266.8 KiB |
Chili (red) | 10994 |
---|---|
Eggs | 10993 |
Oil (vegetable) | 10992 |
Chili (bird's eye) | 10991 |
Sugar | 10990 |
Other values (25) |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Rice |
---|---|
2nd row | Wheat flour |
3rd row | Eggs |
4th row | Meat (beef) |
5th row | Meat (chicken, broiler) |
Common Values
Value | Count | Frequency (%) |
Chili (red) | 10994 | 4.0% |
Eggs | 10993 | 4.0% |
Oil (vegetable) | 10992 | 4.0% |
Chili (bird's eye) | 10991 | 4.0% |
Sugar | 10990 | 4.0% |
Eggs (broiler) | 10838 | 4.0% |
Garlic (medium) | 10838 | 4.0% |
Garlic | 10836 | 4.0% |
Onions (shallot) | 10836 | 4.0% |
Meat (chicken, broiler) | 10821 | 4.0% |
Other values (20) | 162601 |
unit
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 265.6 KiB |
KG | |
---|---|
L | 242 |
385 G | 158 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | KG |
---|---|
2nd row | KG |
3rd row | KG |
4th row | KG |
5th row | KG |
Common Values
Value | Count | Frequency (%) |
KG | 271330 | |
L | 242 | 0.1% |
385 G | 158 | 0.1% |
Common Values (Plot)
priceflag
Text
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.1 MiB |
Value | Count | Frequency (%) |
aggregate | 270067 | |
actual | 1663 | 0.6% |
Most occurring characters
Value | Count | Frequency (%) |
g | 810201 | |
a | 543460 | |
e | 540134 | |
t | 271730 | 11.1% |
r | 270067 | 11.1% |
c | 1663 | 0.1% |
u | 1663 | 0.1% |
l | 1663 | 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 2440581 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
g | 810201 | |
a | 543460 | |
e | 540134 | |
t | 271730 | 11.1% |
r | 270067 | 11.1% |
c | 1663 | 0.1% |
u | 1663 | 0.1% |
l | 1663 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 2440581 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
g | 810201 | |
a | 543460 | |
e | 540134 | |
t | 271730 | 11.1% |
r | 270067 | 11.1% |
c | 1663 | 0.1% |
u | 1663 | 0.1% |
l | 1663 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2440581 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
g | 810201 | |
a | 543460 | |
e | 540134 | |
t | 271730 | 11.1% |
r | 270067 | 11.1% |
c | 1663 | 0.1% |
u | 1663 | 0.1% |
l | 1663 | 0.1% |
pricetype
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 265.6 KiB |
Retail |
---|
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Retail |
---|---|
2nd row | Retail |
3rd row | Retail |
4th row | Retail |
5th row | Retail |
Common Values
Value | Count | Frequency (%) |
Retail | 271730 |
Common Values (Plot)
currency
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 265.6 KiB |
IDR |
---|
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | IDR |
---|---|
2nd row | IDR |
3rd row | IDR |
4th row | IDR |
5th row | IDR |
Common Values
Value | Count | Frequency (%) |
IDR | 271730 |
Common Values (Plot)
price
Real number (ℝ)
Distinct | 51294 |
---|---|
Distinct (%) | 18.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 38058.65811 |
Minimum | 1630.65 |
---|---|
Maximum | 215000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 MiB |
Quantile statistics
Minimum | 1630.65 |
---|---|
5-th percentile | 10850 |
Q1 | 14900 |
median | 27050 |
Q3 | 41049.405 |
95-th percentile | 125000 |
Maximum | 215000 |
Range | 213369.35 |
Interquartile range (IQR) | 26149.405 |
Descriptive statistics
Standard deviation | 34198.11955 |
---|---|
Coefficient of variation (CV) | 0.898563461 |
Kurtosis | 2.488586273 |
Mean | 38058.65811 |
Median Absolute Deviation (MAD) | 12550 |
Skewness | 1.844404869 |
Sum | 1.034167917 × 1010 |
Variance | 1169511381 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
14000 | 3812 | 1.4% |
120000 | 3513 | 1.3% |
15000 | 3385 | 1.2% |
130000 | 2768 | 1.0% |
13000 | 2480 | 0.9% |
14500 | 2031 | 0.7% |
110000 | 1892 | 0.7% |
13500 | 1889 | 0.7% |
16000 | 1777 | 0.7% |
12000 | 1766 | 0.6% |
Other values (51284) | 246417 |
Value | Count | Frequency (%) |
1630.65 | 2 | < 0.1% |
1750 | 2 | < 0.1% |
1975.81 | 2 | < 0.1% |
2000 | 8 | |
2619.05 | 1 | < 0.1% |
Value | Count | Frequency (%) |
215000 | 1 | |
212879.55 | 2 | |
210833.33 | 1 | |
204404.76 | 1 | |
201708.33 | 1 |
year
Real number (ℝ)
Distinct | 17 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2020.913223 |
Minimum | 2007 |
---|---|
Maximum | 2023 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 MiB |
Quantile statistics
Minimum | 2007 |
---|---|
5-th percentile | 2016 |
Q1 | 2020 |
median | 2021 |
Q3 | 2022 |
95-th percentile | 2023 |
Maximum | 2023 |
Range | 16 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 2.006299482 |
---|---|
Coefficient of variation (CV) | 0.0009927687441 |
Kurtosis | 4.09518174 |
Mean | 2020.913223 |
Median Absolute Deviation (MAD) | 1 |
Skewness | -1.689812983 |
Sum | 549142750 |
Variance | 4.025237611 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2022 | 63536 | |
2021 | 63349 | |
2020 | 60714 | |
2023 | 56480 | |
2016 | 22054 | 8.1% |
2017 | 4194 | 1.5% |
2008 | 132 | < 0.1% |
2007 | 132 | < 0.1% |
2012 | 132 | < 0.1% |
2011 | 132 | < 0.1% |
Other values (7) | 875 | 0.3% |
Value | Count | Frequency (%) |
2007 | 132 | |
2008 | 132 | |
2009 | 132 | |
2010 | 132 | |
2011 | 132 |
Value | Count | Frequency (%) |
2023 | 56480 | |
2022 | 63536 | |
2021 | 63349 | |
2020 | 60714 | |
2019 | 120 | < 0.1% |
month
Real number (ℝ)
Distinct | 12 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.585360468 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 4 |
median | 7 |
Q3 | 10 |
95-th percentile | 12 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 3.412121397 |
---|---|
Coefficient of variation (CV) | 0.5181373767 |
Kurtosis | -1.217954182 |
Mean | 6.585360468 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.05698628547 |
Sum | 1789440 |
Variance | 11.64257243 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 25114 | |
11 | 24686 | |
10 | 24570 | |
9 | 24378 | |
8 | 24107 | |
7 | 23962 | |
4 | 21053 | |
5 | 21027 | |
6 | 21017 | |
1 | 20981 | |
Other values (2) | 40835 |
Value | Count | Frequency (%) |
1 | 20981 | |
2 | 20959 | |
3 | 25114 | |
4 | 21053 | |
5 | 21027 |
Value | Count | Frequency (%) |
12 | 19876 | |
11 | 24686 | |
10 | 24570 | |
9 | 24378 | |
8 | 24107 |
day
Real number (ℝ)
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15 |
Minimum | 15 |
---|---|
Maximum | 15 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 MiB |
Quantile statistics
Minimum | 15 |
---|---|
5-th percentile | 15 |
Q1 | 15 |
median | 15 |
Q3 | 15 |
95-th percentile | 15 |
Maximum | 15 |
Range | 0 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 0 |
---|---|
Coefficient of variation (CV) | 0 |
Kurtosis | 0 |
Mean | 15 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 0 |
Sum | 4075950 |
Variance | 0 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
15 | 271730 |
Value | Count | Frequency (%) |
15 | 271730 |
Value | Count | Frequency (%) |
15 | 271730 |