Visual and Data Analysis - FIFA 19 Players

  • Created by Andrés Segura Tinoco
  • Created on May 1, 2019

1. Exploration and Profiling

Loading main libraries and data

In [1]:
# Load the Pandas libraries
import pandas as pd
import pandas_profiling as pp
In [2]:
# Read FIFA 19 players raw data
dataURL = "../data/fifa19_overall_data.csv"
data = pd.read_csv(dataURL)

Exploration

In [3]:
# Dataframe rows and columns
data.shape
Out[3]:
(18207, 89)
In [4]:
# Preview the first 5 lines of the loaded data
data.head()
Out[4]:
Order ID Name Age Photo Nationality Flag Overall Potential Club ... Composure Marking StandingTackle SlidingTackle GKDiving GKHandling GKKicking GKPositioning GKReflexes Release Clause €
0 0 158023 L. Messi 31 https://cdn.sofifa.org/players/4/19/158023.png Argentina https://cdn.sofifa.org/flags/52.png 94 94 FC Barcelona ... 96.0 33.0 28.0 26.0 6.0 11.0 15.0 14.0 8.0 226500000.0
1 1 20801 Cristiano Ronaldo 33 https://cdn.sofifa.org/players/4/19/20801.png Portugal https://cdn.sofifa.org/flags/38.png 94 94 Juventus ... 95.0 28.0 31.0 23.0 7.0 11.0 15.0 14.0 11.0 127100000.0
2 2 190871 Neymar Jr 26 https://cdn.sofifa.org/players/4/19/190871.png Brazil https://cdn.sofifa.org/flags/54.png 92 93 Paris Saint-Germain ... 94.0 27.0 24.0 33.0 9.0 9.0 15.0 15.0 11.0 228100000.0
3 3 193080 De Gea 27 https://cdn.sofifa.org/players/4/19/193080.png Spain https://cdn.sofifa.org/flags/45.png 91 93 Manchester United ... 68.0 15.0 21.0 13.0 90.0 85.0 87.0 88.0 94.0 138600000.0
4 4 192985 K. De Bruyne 27 https://cdn.sofifa.org/players/4/19/192985.png Belgium https://cdn.sofifa.org/flags/7.png 91 92 Manchester City ... 88.0 68.0 58.0 51.0 15.0 13.0 5.0 10.0 13.0 196400000.0

5 rows × 89 columns

In [5]:
# Describe dataframe
data.describe()
Out[5]:
Order ID Age Overall Potential Value € Wage € Special International Reputation Weak Foot ... Composure Marking StandingTackle SlidingTackle GKDiving GKHandling GKKicking GKPositioning GKReflexes Release Clause €
count 18207.000000 18207.000000 18207.000000 18207.000000 18207.000000 1.795500e+04 17966.000000 18207.000000 18159.000000 18159.000000 ... 18159.000000 18159.000000 18159.000000 18159.000000 18159.000000 18159.000000 18159.000000 18159.000000 18159.000000 1.664300e+04
mean 9103.000000 214298.338606 25.122206 66.238699 71.307299 2.444530e+06 9861.850161 1597.809908 1.113222 2.947299 ... 58.648274 47.281623 47.697836 45.661435 16.616223 16.391596 16.232061 16.388898 16.710887 4.585061e+06
std 5256.052511 29965.244204 4.669943 6.908930 6.136496 5.626715e+06 22117.273534 272.586016 0.394031 0.660456 ... 11.436133 19.904397 21.664004 21.289135 17.695349 16.906900 16.502864 17.034669 17.955119 1.111872e+07
min 0.000000 16.000000 16.000000 46.000000 48.000000 1.000000e+04 1000.000000 731.000000 1.000000 1.000000 ... 3.000000 3.000000 2.000000 3.000000 1.000000 1.000000 1.000000 1.000000 1.000000 1.300000e+04
25% 4551.500000 200315.500000 21.000000 62.000000 67.000000 3.250000e+05 1000.000000 1457.000000 1.000000 3.000000 ... 51.000000 30.000000 27.000000 24.000000 8.000000 8.000000 8.000000 8.000000 8.000000 5.250000e+05
50% 9103.000000 221759.000000 25.000000 66.000000 71.000000 7.000000e+05 3000.000000 1635.000000 1.000000 3.000000 ... 60.000000 53.000000 55.000000 52.000000 11.000000 11.000000 11.000000 11.000000 11.000000 1.100000e+06
75% 13654.500000 236529.500000 28.000000 71.000000 75.000000 2.100000e+06 9000.000000 1787.000000 1.000000 3.000000 ... 67.000000 64.000000 66.000000 64.000000 14.000000 14.000000 14.000000 14.000000 14.000000 3.500000e+06
max 18206.000000 246620.000000 45.000000 94.000000 95.000000 1.185000e+08 565000.000000 2346.000000 5.000000 5.000000 ... 96.000000 94.000000 93.000000 91.000000 90.000000 92.000000 91.000000 90.000000 94.000000 2.281000e+08

8 rows × 75 columns

Profiling

Generates profile reports from a pandas DataFrame. The pandas df.describe() function is great but a little basic for serious exploratory data analysis. For each column the following statistics - if relevant for the column type - are presented in an interactive HTML report:

  • Essentials: type, unique values, missing values
  • Quantile statistics like minimum value, Q1, median, Q3, maximum, range, interquartile range
  • Descriptive statistics like mean, mode, standard deviation, sum, median absolute deviation, coefficient of variation, kurtosis, skewness
  • Most frequent values
  • Histogram
  • Correlations highlighting of highly correlated variables, Spearman and Pearson matrixes
In [6]:
# Showing data profiling
pp.ProfileReport(data)
Out[6]:

Overview

Dataset info

Number of variables 89
Number of observations 18207
Total Missing (%) 1.3%
Total size in memory 12.4 MiB
Average record size in memory 712.0 B

Variables types

Numeric 38
Categorical 13
Boolean 0
Date 0
Text (Unique) 1
Rejected 37
Unsupported 0

Warnings

  • BallControl is highly correlated with Dribbling (ρ = 0.93894) Rejected
  • CAM is highly correlated with LAM (ρ = 1) Rejected
  • CB is highly correlated with LCB (ρ = 1) Rejected
  • CDM is highly correlated with LDM (ρ = 1) Rejected
  • CF is highly correlated with LF (ρ = 1) Rejected
  • CM is highly correlated with LCM (ρ = 1) Rejected
  • Club has 241 / 1.3% missing values Missing
  • Club has a high cardinality: 652 distinct values Warning
  • Club Logo has a high cardinality: 679 distinct values Warning
  • Contract Valid Until has 289 / 1.6% missing values Missing
  • Dribbling is highly correlated with RM (ρ = 0.94672) Rejected
  • Flag has a high cardinality: 164 distinct values Warning
  • GKHandling is highly correlated with GKDiving (ρ = 0.97028) Rejected
  • GKKicking is highly correlated with GKHandling (ρ = 0.96524) Rejected
  • GKPositioning is highly correlated with GKKicking (ρ = 0.96434) Rejected
  • GKReflexes is highly correlated with GKPositioning (ρ = 0.97013) Rejected
  • Joined has 1553 / 8.5% missing values Missing
  • Joined has a high cardinality: 1737 distinct values Warning
  • LAM is highly correlated with RW (ρ = 0.99688) Rejected
  • LB is highly correlated with RWB (ρ = 0.99618) Rejected
  • LCB is highly correlated with LB (ρ = 0.97727) Rejected
  • LCM is highly correlated with LM (ρ = 0.98259) Rejected
  • LDM is highly correlated with LWB (ρ = 0.98977) Rejected
  • LF is highly correlated with LW (ρ = 0.99736) Rejected
  • LM is highly correlated with RAM (ρ = 0.99658) Rejected
  • LS has 2085 / 11.5% zeros Zeros
  • LW is highly correlated with RS (ρ = 0.98643) Rejected
  • LWB is highly correlated with RM (ρ = 0.90672) Rejected
  • Loaned_From has 16943 / 93.1% missing values Missing
  • Loaned_From has a high cardinality: 342 distinct values Warning
  • Name has a high cardinality: 17194 distinct values Warning
  • Nationality has a high cardinality: 164 distinct values Warning
  • Positioning is highly correlated with RW (ρ = 0.90325) Rejected
  • RAM is highly correlated with CAM (ρ = 1) Rejected
  • RB is highly correlated with RCB (ρ = 0.97727) Rejected
  • RCB is highly correlated with CB (ρ = 1) Rejected
  • RCM is highly correlated with CM (ρ = 1) Rejected
  • RDM is highly correlated with CDM (ρ = 1) Rejected
  • RF is highly correlated with CF (ρ = 1) Rejected
  • RM is highly correlated with RCM (ρ = 0.98259) Rejected
  • RS is highly correlated with ST (ρ = 1) Rejected
  • RW is highly correlated with RF (ρ = 0.99736) Rejected
  • RWB is highly correlated with RDM (ρ = 0.98977) Rejected
  • Release Clause € is highly correlated with Value € (ρ = 0.99435) Rejected
  • ST is highly correlated with LS (ρ = 1) Rejected
  • ShortPassing is highly correlated with RCM (ρ = 0.92691) Rejected
  • SlidingTackle is highly correlated with StandingTackle (ρ = 0.97466) Rejected
  • SprintSpeed is highly correlated with Acceleration (ρ = 0.92193) Rejected
  • StandingTackle is highly correlated with Marking (ρ = 0.90654) Rejected
  • Value € has 252 / 1.4% missing values Missing
  • Wage € has 241 / 1.3% missing values Missing

Variables

Acceleration
Numeric

Distinct count 87
Unique (%) 0.5%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 64.614
Minimum 12
Maximum 97
Zeros (%) 0.0%

Quantile statistics

Minimum 12
5-th percentile 34
Q1 57
Median 67
Q3 75
95-th percentile 85
Maximum 97
Range 85
Interquartile range 18

Descriptive statistics

Standard deviation 14.928
Coef of variation 0.23103
Kurtosis 0.46964
Mean 64.614
MAD 11.563
Skewness -0.8153
Sum 1173300
Variance 222.84
Memory size 142.3 KiB
Value Count Frequency (%)  
69.0 759 4.2%
 
68.0 741 4.1%
 
67.0 685 3.8%
 
74.0 614 3.4%
 
66.0 608 3.3%
 
65.0 568 3.1%
 
75.0 568 3.1%
 
72.0 549 3.0%
 
78.0 528 2.9%
 
73.0 527 2.9%
 
Other values (76) 12012 66.0%
 

Minimum 5 values

Value Count Frequency (%)  
12.0 1 0.0%
 
13.0 2 0.0%
 
14.0 3 0.0%
 
15.0 17 0.1%
 
16.0 13 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
93.0 41 0.2%
 
94.0 25 0.1%
 
95.0 8 0.0%
 
96.0 2 0.0%
 
97.0 2 0.0%
 

Age
Numeric

Distinct count 29
Unique (%) 0.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 25.122
Minimum 16
Maximum 45
Zeros (%) 0.0%

Quantile statistics

Minimum 16
5-th percentile 18
Q1 21
Median 25
Q3 28
95-th percentile 33
Maximum 45
Range 29
Interquartile range 7

Descriptive statistics

Standard deviation 4.6699
Coef of variation 0.18589
Kurtosis -0.45951
Mean 25.122
MAD 3.8585
Skewness 0.39176
Sum 457400
Variance 21.808
Memory size 142.3 KiB
Value Count Frequency (%)  
21 1423 7.8%
 
26 1387 7.6%
 
24 1358 7.5%
 
22 1340 7.4%
 
23 1332 7.3%
 
25 1319 7.2%
 
20 1240 6.8%
 
27 1162 6.4%
 
28 1101 6.0%
 
19 1024 5.6%
 
Other values (19) 5521 30.3%
 

Minimum 5 values

Value Count Frequency (%)  
16 42 0.2%
 
17 289 1.6%
 
18 732 4.0%
 
19 1024 5.6%
 
20 1240 6.8%
 

Maximum 5 values

Value Count Frequency (%)  
40 13 0.1%
 
41 5 0.0%
 
42 1 0.0%
 
44 2 0.0%
 
45 1 0.0%
 

Aggression
Numeric

Distinct count 86
Unique (%) 0.5%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 55.869
Minimum 11
Maximum 95
Zeros (%) 0.0%

Quantile statistics

Minimum 11
5-th percentile 24
Q1 44
Median 59
Q3 69
95-th percentile 80
Maximum 95
Range 84
Interquartile range 25

Descriptive statistics

Standard deviation 17.368
Coef of variation 0.31087
Kurtosis -0.63114
Mean 55.869
MAD 14.355
Skewness -0.43217
Sum 1014500
Variance 301.65
Memory size 142.3 KiB
Value Count Frequency (%)  
65.0 516 2.8%
 
68.0 514 2.8%
 
70.0 489 2.7%
 
60.0 464 2.5%
 
58.0 458 2.5%
 
55.0 432 2.4%
 
72.0 427 2.3%
 
62.0 427 2.3%
 
66.0 424 2.3%
 
59.0 421 2.3%
 
Other values (75) 13587 74.6%
 

Minimum 5 values

Value Count Frequency (%)  
11.0 5 0.0%
 
12.0 12 0.1%
 
13.0 17 0.1%
 
14.0 11 0.1%
 
15.0 25 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
91.0 13 0.1%
 
92.0 16 0.1%
 
93.0 3 0.0%
 
94.0 8 0.0%
 
95.0 1 0.0%
 

Agility
Numeric

Distinct count 82
Unique (%) 0.5%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 63.504
Minimum 14
Maximum 96
Zeros (%) 0.0%

Quantile statistics

Minimum 14
5-th percentile 34
Q1 55
Median 66
Q3 74
95-th percentile 85
Maximum 96
Range 82
Interquartile range 19

Descriptive statistics

Standard deviation 14.766
Coef of variation 0.23252
Kurtosis -0.068599
Mean 63.504
MAD 11.715
Skewness -0.59942
Sum 1153200
Variance 218.04
Memory size 142.3 KiB
Value Count Frequency (%)  
68.0 601 3.3%
 
70.0 568 3.1%
 
72.0 565 3.1%
 
69.0 556 3.1%
 
66.0 550 3.0%
 
71.0 543 3.0%
 
67.0 542 3.0%
 
73.0 536 2.9%
 
74.0 522 2.9%
 
65.0 521 2.9%
 
Other values (71) 12655 69.5%
 

Minimum 5 values

Value Count Frequency (%)  
14.0 1 0.0%
 
15.0 1 0.0%
 
18.0 4 0.0%
 
19.0 6 0.0%
 
20.0 2 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
92.0 64 0.4%
 
93.0 38 0.2%
 
94.0 19 0.1%
 
95.0 4 0.0%
 
96.0 1 0.0%
 

Balance
Numeric

Distinct count 82
Unique (%) 0.5%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 63.967
Minimum 16
Maximum 96
Zeros (%) 0.0%

Quantile statistics

Minimum 16
5-th percentile 36
Q1 56
Median 66
Q3 74
95-th percentile 84
Maximum 96
Range 80
Interquartile range 18

Descriptive statistics

Standard deviation 14.136
Coef of variation 0.22099
Kurtosis 0.093969
Mean 63.967
MAD 11.154
Skewness -0.58327
Sum 1161600
Variance 199.83
Memory size 142.3 KiB
Value Count Frequency (%)  
68.0 604 3.3%
 
70.0 594 3.3%
 
66.0 589 3.2%
 
65.0 589 3.2%
 
67.0 587 3.2%
 
69.0 574 3.2%
 
71.0 559 3.1%
 
72.0 548 3.0%
 
74.0 534 2.9%
 
73.0 524 2.9%
 
Other values (71) 12457 68.4%
 

Minimum 5 values

Value Count Frequency (%)  
16.0 3 0.0%
 
17.0 3 0.0%
 
18.0 3 0.0%
 
19.0 3 0.0%
 
20.0 7 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
92.0 60 0.3%
 
93.0 42 0.2%
 
94.0 23 0.1%
 
95.0 7 0.0%
 
96.0 5 0.0%
 

BallControl
Highly correlated

This variable is highly correlated with Dribbling and should be ignored for analysis

Correlation 0.93894

Body Type
Categorical

Distinct count 11
Unique (%) 0.1%
Missing (%) 0.3%
Missing (n) 48
Normal
10595
Lean
6417
Stocky
 
1140
Other values (7)
 
7
(Missing)
 
48
Value Count Frequency (%)  
Normal 10595 58.2%
 
Lean 6417 35.2%
 
Stocky 1140 6.3%
 
C. Ronaldo 1 0.0%
 
Courtois 1 0.0%
 
Neymar 1 0.0%
 
Shaqiri 1 0.0%
 
Akinfenwa 1 0.0%
 
PLAYER_BODY_TYPE_25 1 0.0%
 
Messi 1 0.0%
 
(Missing) 48 0.3%
 

CAM
Highly correlated

This variable is highly correlated with LAM and should be ignored for analysis

Correlation 1

CB
Highly correlated

This variable is highly correlated with LCB and should be ignored for analysis

Correlation 1

CDM
Highly correlated

This variable is highly correlated with LDM and should be ignored for analysis

Correlation 1

CF
Highly correlated

This variable is highly correlated with LF and should be ignored for analysis

Correlation 1

CM
Highly correlated

This variable is highly correlated with LCM and should be ignored for analysis

Correlation 1

Club
Categorical

Distinct count 652
Unique (%) 3.6%
Missing (%) 1.3%
Missing (n) 241
Empoli
 
33
RC Celta
 
33
Valencia CF
 
33
Other values (648)
17867
(Missing)
 
241
Value Count Frequency (%)  
Empoli 33 0.2%
 
RC Celta 33 0.2%
 
Valencia CF 33 0.2%
 
Everton 33 0.2%
 
Burnley 33 0.2%
 
Tottenham Hotspur 33 0.2%
 
Manchester United 33 0.2%
 
CD Leganés 33 0.2%
 
Borussia Dortmund 33 0.2%
 
Fortuna Düsseldorf 33 0.2%
 
Other values (641) 17636 96.9%
 
(Missing) 241 1.3%
 
Distinct count 679
Unique (%) 3.7%
Missing (%) 0.0%
Missing (n) 0
https://cdn.sofifa.org/teams/2/light/1.png
 
33
https://cdn.sofifa.org/teams/2/light/18.png
 
33
https://cdn.sofifa.org/teams/2/light/10.png
 
33
Other values (676)
18108
Value Count Frequency (%)  
https://cdn.sofifa.org/teams/2/light/1.png 33 0.2%
 
https://cdn.sofifa.org/teams/2/light/18.png 33 0.2%
 
https://cdn.sofifa.org/teams/2/light/10.png 33 0.2%
 
https://cdn.sofifa.org/teams/2/light/243.png 33 0.2%
 
https://cdn.sofifa.org/teams/2/light/17.png 33 0.2%
 
https://cdn.sofifa.org/teams/2/light/110.png 33 0.2%
 
https://cdn.sofifa.org/teams/2/light/480.png 33 0.2%
 
https://cdn.sofifa.org/teams/2/light/10029.png 33 0.2%
 
https://cdn.sofifa.org/teams/2/light/1961.png 33 0.2%
 
https://cdn.sofifa.org/teams/2/light/100888.png 33 0.2%
 
Other values (669) 17877 98.2%
 

Composure
Numeric

Distinct count 85
Unique (%) 0.5%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 58.648
Minimum 3
Maximum 96
Zeros (%) 0.0%

Quantile statistics

Minimum 3
5-th percentile 39
Q1 51
Median 60
Q3 67
95-th percentile 76
Maximum 96
Range 93
Interquartile range 16

Descriptive statistics

Standard deviation 11.436
Coef of variation 0.195
Kurtosis 0.2144
Mean 58.648
MAD 9.0891
Skewness -0.38103
Sum 1065000
Variance 130.79
Memory size 142.3 KiB
Value Count Frequency (%)  
60.0 743 4.1%
 
58.0 698 3.8%
 
62.0 690 3.8%
 
65.0 689 3.8%
 
55.0 664 3.6%
 
64.0 643 3.5%
 
59.0 616 3.4%
 
63.0 601 3.3%
 
68.0 593 3.3%
 
66.0 560 3.1%
 
Other values (74) 11662 64.1%
 

Minimum 5 values

Value Count Frequency (%)  
3.0 1 0.0%
 
12.0 3 0.0%
 
13.0 3 0.0%
 
14.0 1 0.0%
 
15.0 3 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
92.0 1 0.0%
 
93.0 2 0.0%
 
94.0 1 0.0%
 
95.0 1 0.0%
 
96.0 1 0.0%
 

Contract Valid Until
Categorical

Distinct count 37
Unique (%) 0.2%
Missing (%) 1.6%
Missing (n) 289
2019
4819
2021
4360
2020
4027
Other values (33)
4712
Value Count Frequency (%)  
2019 4819 26.5%
 
2021 4360 23.9%
 
2020 4027 22.1%
 
2022 1477 8.1%
 
2023 1053 5.8%
 
30-Jun-19 931 5.1%
 
2018 886 4.9%
 
31-Dec-18 144 0.8%
 
31-May-19 60 0.3%
 
1-Jan-19 51 0.3%
 
Other values (26) 110 0.6%
 
(Missing) 289 1.6%
 

Crossing
Numeric

Distinct count 90
Unique (%) 0.5%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 49.734
Minimum 5
Maximum 93
Zeros (%) 0.0%

Quantile statistics

Minimum 5
5-th percentile 13
Q1 38
Median 54
Q3 64
95-th percentile 74
Maximum 93
Range 88
Interquartile range 26

Descriptive statistics

Standard deviation 18.365
Coef of variation 0.36925
Kurtosis -0.54331
Mean 49.734
MAD 15.163
Skewness -0.59448
Sum 903120
Variance 337.26
Memory size 142.3 KiB
Value Count Frequency (%)  
65.0 528 2.9%
 
62.0 528 2.9%
 
60.0 518 2.8%
 
59.0 508 2.8%
 
64.0 497 2.7%
 
58.0 491 2.7%
 
63.0 465 2.6%
 
61.0 446 2.4%
 
55.0 431 2.4%
 
66.0 429 2.4%
 
Other values (79) 13318 73.1%
 

Minimum 5 values

Value Count Frequency (%)  
5.0 1 0.0%
 
6.0 3 0.0%
 
7.0 9 0.0%
 
8.0 23 0.1%
 
9.0 39 0.2%
 

Maximum 5 values

Value Count Frequency (%)  
89.0 2 0.0%
 
90.0 3 0.0%
 
91.0 4 0.0%
 
92.0 1 0.0%
 
93.0 1 0.0%
 

Curve
Numeric

Distinct count 90
Unique (%) 0.5%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 47.171
Minimum 6
Maximum 94
Zeros (%) 0.0%

Quantile statistics

Minimum 6
5-th percentile 13
Q1 34
Median 48
Q3 62
95-th percentile 75
Maximum 94
Range 88
Interquartile range 28

Descriptive statistics

Standard deviation 18.395
Coef of variation 0.38997
Kurtosis -0.74397
Mean 47.171
MAD 15.28
Skewness -0.23819
Sum 856580
Variance 338.39
Memory size 142.3 KiB
Value Count Frequency (%)  
58.0 400 2.2%
 
48.0 393 2.2%
 
45.0 391 2.1%
 
60.0 376 2.1%
 
49.0 369 2.0%
 
64.0 368 2.0%
 
55.0 367 2.0%
 
42.0 363 2.0%
 
65.0 361 2.0%
 
59.0 352 1.9%
 
Other values (79) 14419 79.2%
 

Minimum 5 values

Value Count Frequency (%)  
6.0 4 0.0%
 
7.0 4 0.0%
 
8.0 11 0.1%
 
9.0 41 0.2%
 
10.0 147 0.8%
 

Maximum 5 values

Value Count Frequency (%)  
90.0 6 0.0%
 
91.0 1 0.0%
 
92.0 1 0.0%
 
93.0 1 0.0%
 
94.0 1 0.0%
 

Dribbling
Highly correlated

This variable is highly correlated with RM and should be ignored for analysis

Correlation 0.94672

FKAccuracy
Numeric

Distinct count 91
Unique (%) 0.5%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 42.863
Minimum 3
Maximum 94
Zeros (%) 0.0%

Quantile statistics

Minimum 3
5-th percentile 13
Q1 31
Median 41
Q3 57
95-th percentile 72
Maximum 94
Range 91
Interquartile range 26

Descriptive statistics

Standard deviation 17.479
Coef of variation 0.40778
Kurtosis -0.71834
Mean 42.863
MAD 14.429
Skewness 0.10212
Sum 778350
Variance 305.51
Memory size 142.3 KiB
Value Count Frequency (%)  
42.0 485 2.7%
 
40.0 479 2.6%
 
32.0 473 2.6%
 
35.0 458 2.5%
 
39.0 441 2.4%
 
34.0 437 2.4%
 
30.0 423 2.3%
 
38.0 407 2.2%
 
31.0 395 2.2%
 
33.0 393 2.2%
 
Other values (80) 13768 75.6%
 

Minimum 5 values

Value Count Frequency (%)  
3.0 1 0.0%
 
4.0 1 0.0%
 
6.0 3 0.0%
 
7.0 8 0.0%
 
8.0 20 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
90.0 2 0.0%
 
91.0 1 0.0%
 
92.0 1 0.0%
 
93.0 1 0.0%
 
94.0 1 0.0%
 

Finishing
Numeric

Distinct count 94
Unique (%) 0.5%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 45.551
Minimum 2
Maximum 95
Zeros (%) 0.0%

Quantile statistics

Minimum 2
5-th percentile 11
Q1 30
Median 49
Q3 62
95-th percentile 73
Maximum 95
Range 93
Interquartile range 32

Descriptive statistics

Standard deviation 19.526
Coef of variation 0.42866
Kurtosis -0.97261
Mean 45.551
MAD 16.772
Skewness -0.30088
Sum 827160
Variance 381.26
Memory size 142.3 KiB
Value Count Frequency (%)  
58.0 462 2.5%
 
59.0 433 2.4%
 
62.0 430 2.4%
 
60.0 427 2.3%
 
65.0 424 2.3%
 
66.0 401 2.2%
 
64.0 399 2.2%
 
57.0 384 2.1%
 
61.0 382 2.1%
 
55.0 374 2.1%
 
Other values (83) 14043 77.1%
 

Minimum 5 values

Value Count Frequency (%)  
2.0 2 0.0%
 
4.0 5 0.0%
 
5.0 80 0.4%
 
6.0 95 0.5%
 
7.0 115 0.6%
 

Maximum 5 values

Value Count Frequency (%)  
91.0 2 0.0%
 
92.0 1 0.0%
 
93.0 2 0.0%
 
94.0 2 0.0%
 
95.0 1 0.0%
 

Flag
Categorical

Distinct count 164
Unique (%) 0.9%
Missing (%) 0.0%
Missing (n) 0
https://cdn.sofifa.org/flags/14.png
 
1662
https://cdn.sofifa.org/flags/21.png
 
1198
https://cdn.sofifa.org/flags/45.png
 
1072
Other values (161)
14275
Value Count Frequency (%)  
https://cdn.sofifa.org/flags/14.png 1662 9.1%
 
https://cdn.sofifa.org/flags/21.png 1198 6.6%
 
https://cdn.sofifa.org/flags/45.png 1072 5.9%
 
https://cdn.sofifa.org/flags/52.png 937 5.1%
 
https://cdn.sofifa.org/flags/18.png 914 5.0%
 
https://cdn.sofifa.org/flags/54.png 827 4.5%
 
https://cdn.sofifa.org/flags/27.png 702 3.9%
 
https://cdn.sofifa.org/flags/56.png 618 3.4%
 
https://cdn.sofifa.org/flags/163.png 478 2.6%
 
https://cdn.sofifa.org/flags/34.png 453 2.5%
 
Other values (154) 9346 51.3%
 

GKDiving
Numeric

Distinct count 72
Unique (%) 0.4%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 16.616
Minimum 1
Maximum 90
Zeros (%) 0.0%

Quantile statistics

Minimum 1
5-th percentile 6
Q1 8
Median 11
Q3 14
95-th percentile 66
Maximum 90
Range 89
Interquartile range 6

Descriptive statistics

Standard deviation 17.695
Coef of variation 1.0649
Kurtosis 4.4459
Mean 16.616
MAD 10.878
Skewness 2.441
Sum 301730
Variance 313.13
Memory size 142.3 KiB
Value Count Frequency (%)  
8.0 1617 8.9%
 
9.0 1586 8.7%
 
7.0 1581 8.7%
 
12.0 1575 8.7%
 
14.0 1564 8.6%
 
10.0 1563 8.6%
 
13.0 1551 8.5%
 
11.0 1500 8.2%
 
6.0 1305 7.2%
 
15.0 1236 6.8%
 
Other values (61) 3081 16.9%
 

Minimum 5 values

Value Count Frequency (%)  
1.0 27 0.1%
 
2.0 6 0.0%
 
3.0 19 0.1%
 
4.0 27 0.1%
 
5.0 388 2.1%
 

Maximum 5 values

Value Count Frequency (%)  
85.0 8 0.0%
 
86.0 4 0.0%
 
87.0 3 0.0%
 
88.0 3 0.0%
 
90.0 3 0.0%
 

GKHandling
Highly correlated

This variable is highly correlated with GKDiving and should be ignored for analysis

Correlation 0.97028

GKKicking
Highly correlated

This variable is highly correlated with GKHandling and should be ignored for analysis

Correlation 0.96524

GKPositioning
Highly correlated

This variable is highly correlated with GKKicking and should be ignored for analysis

Correlation 0.96434

GKReflexes
Highly correlated

This variable is highly correlated with GKPositioning and should be ignored for analysis

Correlation 0.97013

HeadingAccuracy
Numeric

Distinct count 92
Unique (%) 0.5%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 52.298
Minimum 4
Maximum 94
Zeros (%) 0.0%

Quantile statistics

Minimum 4
5-th percentile 13
Q1 44
Median 56
Q3 64
95-th percentile 75
Maximum 94
Range 90
Interquartile range 20

Descriptive statistics

Standard deviation 17.38
Coef of variation 0.33232
Kurtosis 0.29251
Mean 52.298
MAD 13.397
Skewness -0.88513
Sum 949680
Variance 302.06
Memory size 142.3 KiB
Value Count Frequency (%)  
58.0 646 3.5%
 
59.0 572 3.1%
 
60.0 571 3.1%
 
55.0 565 3.1%
 
62.0 528 2.9%
 
65.0 510 2.8%
 
61.0 485 2.7%
 
64.0 483 2.7%
 
54.0 481 2.6%
 
57.0 471 2.6%
 
Other values (81) 12847 70.6%
 

Minimum 5 values

Value Count Frequency (%)  
4.0 2 0.0%
 
5.0 3 0.0%
 
6.0 2 0.0%
 
7.0 7 0.0%
 
8.0 20 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
90.0 6 0.0%
 
91.0 4 0.0%
 
92.0 3 0.0%
 
93.0 3 0.0%
 
94.0 2 0.0%
 

HeightMts
Numeric

Distinct count 22
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.8076
Minimum 0
Maximum 2.06
Zeros (%) 0.3%

Quantile statistics

Minimum 0
5-th percentile 1.7
Q1 1.75
Median 1.8
Q3 1.85
95-th percentile 1.93
Maximum 2.06
Range 2.06
Interquartile range 0.1

Descriptive statistics

Standard deviation 0.11491
Coef of variation 0.063572
Kurtosis 158.81
Mean 1.8076
MAD 0.059886
Skewness -10.215
Sum 32910
Variance 0.013204
Memory size 142.3 KiB
Value Count Frequency (%)  
1.83 2881 15.8%
 
1.78 2479 13.6%
 
1.75 2238 12.3%
 
1.8 2159 11.9%
 
1.88 2015 11.1%
 
1.85 1908 10.5%
 
1.91 990 5.4%
 
1.73 946 5.2%
 
1.7 905 5.0%
 
1.93 749 4.1%
 
Other values (12) 937 5.1%
 

Minimum 5 values

Value Count Frequency (%)  
0.0 48 0.3%
 
1.55 3 0.0%
 
1.57 5 0.0%
 
1.6 18 0.1%
 
1.63 30 0.2%
 

Maximum 5 values

Value Count Frequency (%)  
1.96 246 1.4%
 
1.98 93 0.5%
 
2.01 21 0.1%
 
2.03 10 0.1%
 
2.06 2 0.0%
 

ID
Numeric

Distinct count 18207
Unique (%) 100.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 214300
Minimum 16
Maximum 246620
Zeros (%) 0.0%

Quantile statistics

Minimum 16
5-th percentile 163960
Q1 200320
Median 221760
Q3 236530
95-th percentile 244740
Maximum 246620
Range 246604
Interquartile range 36214

Descriptive statistics

Standard deviation 29965
Coef of variation 0.13983
Kurtosis 9.6095
Mean 214300
MAD 21992
Skewness -2.268
Sum 3901729851
Variance 897920000
Memory size 142.3 KiB
Value Count Frequency (%)  
231423 1 0.0%
 
225883 1 0.0%
 
205393 1 0.0%
 
244306 1 0.0%
 
209491 1 0.0%
 
229973 1 0.0%
 
201303 1 0.0%
 
243259 1 0.0%
 
215644 1 0.0%
 
213837 1 0.0%
 
Other values (18197) 18197 99.9%
 

Minimum 5 values

Value Count Frequency (%)  
16 1 0.0%
 
41 1 0.0%
 
80 1 0.0%
 
164 1 0.0%
 
657 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
246609 1 0.0%
 
246613 1 0.0%
 
246616 1 0.0%
 
246617 1 0.0%
 
246620 1 0.0%
 

Interceptions
Numeric

Distinct count 90
Unique (%) 0.5%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 46.698
Minimum 3
Maximum 92
Zeros (%) 0.0%

Quantile statistics

Minimum 3
5-th percentile 13
Q1 26
Median 52
Q3 64
95-th percentile 75
Maximum 92
Range 89
Interquartile range 38

Descriptive statistics

Standard deviation 20.697
Coef of variation 0.4432
Kurtosis -1.2643
Mean 46.698
MAD 18.38
Skewness -0.26839
Sum 847990
Variance 428.36
Memory size 142.3 KiB
Value Count Frequency (%)  
62.0 509 2.8%
 
66.0 505 2.8%
 
65.0 493 2.7%
 
63.0 477 2.6%
 
64.0 465 2.6%
 
60.0 428 2.4%
 
68.0 408 2.2%
 
58.0 402 2.2%
 
67.0 389 2.1%
 
22.0 373 2.0%
 
Other values (79) 13710 75.3%
 

Minimum 5 values

Value Count Frequency (%)  
3.0 1 0.0%
 
4.0 1 0.0%
 
5.0 6 0.0%
 
6.0 46 0.3%
 
7.0 46 0.3%
 

Maximum 5 values

Value Count Frequency (%)  
87.0 12 0.1%
 
88.0 5 0.0%
 
89.0 3 0.0%
 
90.0 3 0.0%
 
92.0 2 0.0%
 

International Reputation
Numeric

Distinct count 6
Unique (%) 0.0%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.1132
Minimum 1
Maximum 5
Zeros (%) 0.0%

Quantile statistics

Minimum 1
5-th percentile 1
Q1 1
Median 1
Q3 1
95-th percentile 2
Maximum 5
Range 4
Interquartile range 0

Descriptive statistics

Standard deviation 0.39403
Coef of variation 0.35396
Kurtosis 18.961
Mean 1.1132
MAD 0.20616
Skewness 4.0604
Sum 20215
Variance 0.15526
Memory size 142.3 KiB
Value Count Frequency (%)  
1.0 16532 90.8%
 
2.0 1261 6.9%
 
3.0 309 1.7%
 
4.0 51 0.3%
 
5.0 6 0.0%
 
(Missing) 48 0.3%
 

Minimum 5 values

Value Count Frequency (%)  
1.0 16532 90.8%
 
2.0 1261 6.9%
 
3.0 309 1.7%
 
4.0 51 0.3%
 
5.0 6 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1.0 16532 90.8%
 
2.0 1261 6.9%
 
3.0 309 1.7%
 
4.0 51 0.3%
 
5.0 6 0.0%
 

Jersey Number
Numeric

Distinct count 100
Unique (%) 0.5%
Missing (%) 0.3%
Missing (n) 60
Infinite (%) 0.0%
Infinite (n) 0
Mean 19.546
Minimum 1
Maximum 99
Zeros (%) 0.0%

Quantile statistics

Minimum 1
5-th percentile 2
Q1 8
Median 17
Q3 26
95-th percentile 44
Maximum 99
Range 98
Interquartile range 18

Descriptive statistics

Standard deviation 15.948
Coef of variation 0.81591
Kurtosis 7.1353
Mean 19.546
MAD 11.099
Skewness 2.1855
Sum 354700
Variance 254.33
Memory size 142.3 KiB
Value Count Frequency (%)  
8.0 612 3.4%
 
7.0 604 3.3%
 
10.0 593 3.3%
 
11.0 590 3.2%
 
6.0 586 3.2%
 
5.0 579 3.2%
 
9.0 577 3.2%
 
4.0 573 3.1%
 
20.0 568 3.1%
 
1.0 566 3.1%
 
Other values (89) 12299 67.6%
 

Minimum 5 values

Value Count Frequency (%)  
1.0 566 3.1%
 
2.0 519 2.9%
 
3.0 547 3.0%
 
4.0 573 3.1%
 
5.0 579 3.2%
 

Maximum 5 values

Value Count Frequency (%)  
95.0 12 0.1%
 
96.0 13 0.1%
 
97.0 16 0.1%
 
98.0 21 0.1%
 
99.0 70 0.4%
 

Joined
Categorical

Distinct count 1737
Unique (%) 9.5%
Missing (%) 8.5%
Missing (n) 1553
1-Jul-18
 
1538
1-Jul-17
 
1133
1-Jan-18
 
635
Other values (1733)
13348
(Missing)
 
1553
Value Count Frequency (%)  
1-Jul-18 1538 8.4%
 
1-Jul-17 1133 6.2%
 
1-Jan-18 635 3.5%
 
1-Jul-16 614 3.4%
 
1-Jul-15 368 2.0%
 
1-Jan-17 231 1.3%
 
1-Jul-14 226 1.2%
 
1-Jan-16 180 1.0%
 
1-Jul-13 156 0.9%
 
1-Jan-15 143 0.8%
 
Other values (1726) 11430 62.8%
 
(Missing) 1553 8.5%
 

Jumping
Numeric

Distinct count 75
Unique (%) 0.4%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 65.089
Minimum 15
Maximum 95
Zeros (%) 0.0%

Quantile statistics

Minimum 15
5-th percentile 43
Q1 58
Median 66
Q3 73
95-th percentile 83
Maximum 95
Range 80
Interquartile range 15

Descriptive statistics

Standard deviation 11.82
Coef of variation 0.1816
Kurtosis 0.32485
Mean 65.089
MAD 9.268
Skewness -0.45432
Sum 1182000
Variance 139.71
Memory size 142.3 KiB
Value Count Frequency (%)  
70.0 718 3.9%
 
72.0 651 3.6%
 
68.0 647 3.6%
 
71.0 631 3.5%
 
65.0 626 3.4%
 
63.0 624 3.4%
 
67.0 621 3.4%
 
73.0 616 3.4%
 
64.0 600 3.3%
 
69.0 593 3.3%
 
Other values (64) 11832 65.0%
 

Minimum 5 values

Value Count Frequency (%)  
15.0 1 0.0%
 
16.0 1 0.0%
 
22.0 1 0.0%
 
23.0 1 0.0%
 
26.0 4 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
91.0 75 0.4%
 
92.0 45 0.2%
 
93.0 18 0.1%
 
94.0 5 0.0%
 
95.0 1 0.0%
 

LAM
Highly correlated

This variable is highly correlated with RW and should be ignored for analysis

Correlation 0.99688

LB
Highly correlated

This variable is highly correlated with RWB and should be ignored for analysis

Correlation 0.99618

LCB
Highly correlated

This variable is highly correlated with LB and should be ignored for analysis

Correlation 0.97727

LCM
Highly correlated

This variable is highly correlated with LM and should be ignored for analysis

Correlation 0.98259

LDM
Highly correlated

This variable is highly correlated with LWB and should be ignored for analysis

Correlation 0.98977

LF
Highly correlated

This variable is highly correlated with LW and should be ignored for analysis

Correlation 0.99736

LM
Highly correlated

This variable is highly correlated with RAM and should be ignored for analysis

Correlation 0.99658

LS
Numeric

Distinct count 60
Unique (%) 0.3%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 51.195
Minimum 0
Maximum 91
Zeros (%) 11.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 48
Median 57
Q3 64
95-th percentile 71
Maximum 91
Range 91
Interquartile range 16

Descriptive statistics

Standard deviation 20.308
Coef of variation 0.39669
Kurtosis 1.8273
Mean 51.195
MAD 14.15
Skewness -1.643
Sum 932101
Variance 412.43
Memory size 142.3 KiB
Value Count Frequency (%)  
0 2085 11.5%
 
61 714 3.9%
 
60 698 3.8%
 
59 694 3.8%
 
63 673 3.7%
 
64 671 3.7%
 
58 666 3.7%
 
57 654 3.6%
 
62 628 3.4%
 
56 604 3.3%
 
Other values (50) 10120 55.6%
 

Minimum 5 values

Value Count Frequency (%)  
0 2085 11.5%
 
31 2 0.0%
 
32 8 0.0%
 
33 12 0.1%
 
34 33 0.2%
 

Maximum 5 values

Value Count Frequency (%)  
85 3 0.0%
 
86 4 0.0%
 
87 2 0.0%
 
88 1 0.0%
 
91 1 0.0%
 

LW
Highly correlated

This variable is highly correlated with RS and should be ignored for analysis

Correlation 0.98643

LWB
Highly correlated

This variable is highly correlated with RM and should be ignored for analysis

Correlation 0.90672

Loaned_From
Categorical

Distinct count 342
Unique (%) 1.9%
Missing (%) 93.1%
Missing (n) 16943
Atalanta
 
20
Sassuolo
 
18
SL Benfica
 
17
Other values (338)
 
1209
(Missing)
16943
Value Count Frequency (%)  
Atalanta 20 0.1%
 
Sassuolo 18 0.1%
 
SL Benfica 17 0.1%
 
Juventus 17 0.1%
 
Sporting CP 14 0.1%
 
AS Monaco 12 0.1%
 
FC Red Bull Salzburg 12 0.1%
 
Real Madrid 12 0.1%
 
Liverpool 12 0.1%
 
Manchester City 12 0.1%
 
Other values (331) 1118 6.1%
 
(Missing) 16943 93.1%
 

LongPassing
Numeric

Distinct count 85
Unique (%) 0.5%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 52.712
Minimum 9
Maximum 93
Zeros (%) 0.0%

Quantile statistics

Minimum 9
5-th percentile 23
Q1 43
Median 56
Q3 64
95-th percentile 74
Maximum 93
Range 84
Interquartile range 21

Descriptive statistics

Standard deviation 15.328
Coef of variation 0.29079
Kurtosis -0.37654
Mean 52.712
MAD 12.475
Skewness -0.59564
Sum 957200
Variance 234.94
Memory size 142.3 KiB
Value Count Frequency (%)  
62.0 625 3.4%
 
59.0 603 3.3%
 
58.0 602 3.3%
 
63.0 598 3.3%
 
60.0 593 3.3%
 
64.0 565 3.1%
 
65.0 557 3.1%
 
55.0 537 2.9%
 
57.0 510 2.8%
 
61.0 508 2.8%
 
Other values (74) 12461 68.4%
 

Minimum 5 values

Value Count Frequency (%)  
9.0 3 0.0%
 
10.0 3 0.0%
 
11.0 11 0.1%
 
12.0 18 0.1%
 
13.0 22 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
88.0 2 0.0%
 
89.0 4 0.0%
 
90.0 2 0.0%
 
91.0 1 0.0%
 
93.0 1 0.0%
 

LongShots
Numeric

Distinct count 93
Unique (%) 0.5%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 47.11
Minimum 3
Maximum 94
Zeros (%) 0.0%

Quantile statistics

Minimum 3
5-th percentile 12
Q1 33
Median 51
Q3 62
95-th percentile 73
Maximum 94
Range 91
Interquartile range 29

Descriptive statistics

Standard deviation 19.261
Coef of variation 0.40884
Kurtosis -0.82051
Mean 47.11
MAD 16.267
Skewness -0.43227
Sum 855470
Variance 370.97
Memory size 142.3 KiB
Value Count Frequency (%)  
59.0 508 2.8%
 
58.0 496 2.7%
 
62.0 485 2.7%
 
65.0 436 2.4%
 
55.0 435 2.4%
 
64.0 430 2.4%
 
60.0 408 2.2%
 
56.0 401 2.2%
 
52.0 395 2.2%
 
63.0 385 2.1%
 
Other values (82) 13780 75.7%
 

Minimum 5 values

Value Count Frequency (%)  
3.0 1 0.0%
 
4.0 6 0.0%
 
5.0 84 0.5%
 
6.0 110 0.6%
 
7.0 107 0.6%
 

Maximum 5 values

Value Count Frequency (%)  
90.0 2 0.0%
 
91.0 2 0.0%
 
92.0 2 0.0%
 
93.0 2 0.0%
 
94.0 1 0.0%
 

Marking
Numeric

Distinct count 93
Unique (%) 0.5%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 47.282
Minimum 3
Maximum 94
Zeros (%) 0.0%

Quantile statistics

Minimum 3
5-th percentile 13
Q1 30
Median 53
Q3 64
95-th percentile 74
Maximum 94
Range 91
Interquartile range 34

Descriptive statistics

Standard deviation 19.904
Coef of variation 0.42098
Kurtosis -1.0924
Mean 47.282
MAD 17.399
Skewness -0.36268
Sum 858590
Variance 396.19
Memory size 142.3 KiB
Value Count Frequency (%)  
60.0 573 3.1%
 
62.0 549 3.0%
 
65.0 525 2.9%
 
64.0 485 2.7%
 
58.0 466 2.6%
 
55.0 453 2.5%
 
63.0 428 2.4%
 
66.0 427 2.3%
 
59.0 424 2.3%
 
68.0 420 2.3%
 
Other values (82) 13409 73.6%
 

Minimum 5 values

Value Count Frequency (%)  
3.0 1 0.0%
 
4.0 1 0.0%
 
5.0 16 0.1%
 
6.0 29 0.2%
 
7.0 53 0.3%
 

Maximum 5 values

Value Count Frequency (%)  
90.0 9 0.0%
 
91.0 2 0.0%
 
92.0 1 0.0%
 
93.0 1 0.0%
 
94.0 1 0.0%
 

Name
Categorical

Distinct count 17194
Unique (%) 94.4%
Missing (%) 0.0%
Missing (n) 0
J. Rodríguez
 
11
Paulinho
 
8
R. Williams
 
7
Other values (17191)
18181
Value Count Frequency (%)  
J. Rodríguez 11 0.1%
 
Paulinho 8 0.0%
 
R. Williams 7 0.0%
 
J. Williams 7 0.0%
 
J. Valencia 6 0.0%
 
Felipe 6 0.0%
 
J. Gómez 6 0.0%
 
J. Hernández 6 0.0%
 
J. Jones 5 0.0%
 
J. Murillo 5 0.0%
 
Other values (17184) 18140 99.6%
 

Nationality
Categorical

Distinct count 164
Unique (%) 0.9%
Missing (%) 0.0%
Missing (n) 0
England
 
1662
Germany
 
1198
Spain
 
1072
Other values (161)
14275
Value Count Frequency (%)  
England 1662 9.1%
 
Germany 1198 6.6%
 
Spain 1072 5.9%
 
Argentina 937 5.1%
 
France 914 5.0%
 
Brazil 827 4.5%
 
Italy 702 3.9%
 
Colombia 618 3.4%
 
Japan 478 2.6%
 
Netherlands 453 2.5%
 
Other values (154) 9346 51.3%
 

Order
Numeric

Distinct count 18207
Unique (%) 100.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 9103
Minimum 0
Maximum 18206
Zeros (%) 0.0%

Quantile statistics

Minimum 0
5-th percentile 910.3
Q1 4551.5
Median 9103
Q3 13654
95-th percentile 17296
Maximum 18206
Range 18206
Interquartile range 9103

Descriptive statistics

Standard deviation 5256.1
Coef of variation 0.5774
Kurtosis -1.2
Mean 9103
MAD 4551.7
Skewness 0
Sum 165738321
Variance 27626000
Memory size 142.3 KiB
Value Count Frequency (%)  
2047 1 0.0%
 
15010 1 0.0%
 
661 1 0.0%
 
6806 1 0.0%
 
4759 1 0.0%
 
17053 1 0.0%
 
10912 1 0.0%
 
8865 1 0.0%
 
12963 1 0.0%
 
12947 1 0.0%
 
Other values (18197) 18197 99.9%
 

Minimum 5 values

Value Count Frequency (%)  
0 1 0.0%
 
1 1 0.0%
 
2 1 0.0%
 
3 1 0.0%
 
4 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
18202 1 0.0%
 
18203 1 0.0%
 
18204 1 0.0%
 
18205 1 0.0%
 
18206 1 0.0%
 

Overall
Numeric

Distinct count 48
Unique (%) 0.3%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 66.239
Minimum 46
Maximum 94
Zeros (%) 0.0%

Quantile statistics

Minimum 46
5-th percentile 54
Q1 62
Median 66
Q3 71
95-th percentile 77
Maximum 94
Range 48
Interquartile range 9

Descriptive statistics

Standard deviation 6.9089
Coef of variation 0.1043
Kurtosis 0.096569
Mean 66.239
MAD 5.4443
Skewness 0.067185
Sum 1206008
Variance 47.733
Memory size 142.3 KiB
Value Count Frequency (%)  
66 1163 6.4%
 
67 1118 6.1%
 
64 1091 6.0%
 
65 1045 5.7%
 
68 1035 5.7%
 
63 1002 5.5%
 
69 973 5.3%
 
70 889 4.9%
 
62 878 4.8%
 
71 783 4.3%
 
Other values (38) 8230 45.2%
 

Minimum 5 values

Value Count Frequency (%)  
46 1 0.0%
 
47 20 0.1%
 
48 32 0.2%
 
49 36 0.2%
 
50 103 0.6%
 

Maximum 5 values

Value Count Frequency (%)  
89 11 0.1%
 
90 5 0.0%
 
91 6 0.0%
 
92 1 0.0%
 
94 2 0.0%
 

Penalties
Numeric

Distinct count 88
Unique (%) 0.5%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 48.549
Minimum 5
Maximum 92
Zeros (%) 0.0%

Quantile statistics

Minimum 5
5-th percentile 18
Q1 39
Median 49
Q3 60
95-th percentile 72
Maximum 92
Range 87
Interquartile range 21

Descriptive statistics

Standard deviation 15.704
Coef of variation 0.32347
Kurtosis -0.34735
Mean 48.549
MAD 12.708
Skewness -0.35497
Sum 881590
Variance 246.62
Memory size 142.3 KiB
Value Count Frequency (%)  
45.0 492 2.7%
 
55.0 488 2.7%
 
49.0 480 2.6%
 
48.0 473 2.6%
 
58.0 471 2.6%
 
59.0 468 2.6%
 
60.0 452 2.5%
 
50.0 444 2.4%
 
44.0 426 2.3%
 
42.0 420 2.3%
 
Other values (77) 13545 74.4%
 

Minimum 5 values

Value Count Frequency (%)  
5.0 1 0.0%
 
7.0 1 0.0%
 
8.0 4 0.0%
 
9.0 6 0.0%
 
10.0 52 0.3%
 

Maximum 5 values

Value Count Frequency (%)  
88.0 7 0.0%
 
89.0 3 0.0%
 
90.0 6 0.0%
 
91.0 1 0.0%
 
92.0 1 0.0%
 

Photo
Categorical, Unique

First 3 values
Last 3 values

First 10 values

Value Count Frequency (%)  
https://cdn.sofifa.org/players/4/19/100803.png 1 0.0%
 
https://cdn.sofifa.org/players/4/19/100899.png 1 0.0%
 
https://cdn.sofifa.org/players/4/19/101317.png 1 0.0%
 
https://cdn.sofifa.org/players/4/19/101473.png 1 0.0%
 
https://cdn.sofifa.org/players/4/19/101488.png 1 0.0%
 

Last 10 values

Value Count Frequency (%)  
https://cdn.sofifa.org/players/4/19/768.png 1 0.0%
 
https://cdn.sofifa.org/players/4/19/80.png 1 0.0%
 
https://cdn.sofifa.org/players/4/19/8205.png 1 0.0%
 
https://cdn.sofifa.org/players/4/19/9014.png 1 0.0%
 
https://cdn.sofifa.org/players/4/19/9833.png 1 0.0%
 

Position
Categorical

Distinct count 28
Unique (%) 0.2%
Missing (%) 0.3%
Missing (n) 60
ST
 
2152
GK
 
2025
CB
 
1778
Other values (24)
12192
Value Count Frequency (%)  
ST 2152 11.8%
 
GK 2025 11.1%
 
CB 1778 9.8%
 
CM 1394 7.7%
 
LB 1322 7.3%
 
RB 1291 7.1%
 
RM 1124 6.2%
 
LM 1095 6.0%
 
CAM 958 5.3%
 
CDM 948 5.2%
 
Other values (17) 4060 22.3%
 

Positioning
Highly correlated

This variable is highly correlated with RW and should be ignored for analysis

Correlation 0.90325

Potential
Numeric

Distinct count 47
Unique (%) 0.3%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 71.307
Minimum 48
Maximum 95
Zeros (%) 0.0%

Quantile statistics

Minimum 48
5-th percentile 62
Q1 67
Median 71
Q3 75
95-th percentile 82
Maximum 95
Range 47
Interquartile range 8

Descriptive statistics

Standard deviation 6.1365
Coef of variation 0.086057
Kurtosis 0.035826
Mean 71.307
MAD 4.9083
Skewness 0.26615
Sum 1298292
Variance 37.657
Memory size 142.3 KiB
Value Count Frequency (%)  
70 1203 6.6%
 
69 1175 6.5%
 
71 1140 6.3%
 
68 1136 6.2%
 
72 1122 6.2%
 
73 1051 5.8%
 
74 1015 5.6%
 
66 996 5.5%
 
67 991 5.4%
 
75 951 5.2%
 
Other values (37) 7427 40.8%
 

Minimum 5 values

Value Count Frequency (%)  
48 2 0.0%
 
50 2 0.0%
 
51 2 0.0%
 
52 10 0.1%
 
53 6 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
91 12 0.1%
 
92 9 0.0%
 
93 4 0.0%
 
94 3 0.0%
 
95 1 0.0%
 

Preferred Foot
Categorical

Distinct count 3
Unique (%) 0.0%
Missing (%) 0.3%
Missing (n) 48
Right
13948
Left
4211
(Missing)
 
48
Value Count Frequency (%)  
Right 13948 76.6%
 
Left 4211 23.1%
 
(Missing) 48 0.3%
 

RAM
Highly correlated

This variable is highly correlated with CAM and should be ignored for analysis

Correlation 1

RB
Highly correlated

This variable is highly correlated with RCB and should be ignored for analysis

Correlation 0.97727

RCB
Highly correlated

This variable is highly correlated with CB and should be ignored for analysis

Correlation 1

RCM
Highly correlated

This variable is highly correlated with CM and should be ignored for analysis

Correlation 1

RDM
Highly correlated

This variable is highly correlated with CDM and should be ignored for analysis

Correlation 1

RF
Highly correlated

This variable is highly correlated with CF and should be ignored for analysis

Correlation 1

RM
Highly correlated

This variable is highly correlated with RCM and should be ignored for analysis

Correlation 0.98259

RS
Highly correlated

This variable is highly correlated with ST and should be ignored for analysis

Correlation 1

RW
Highly correlated

This variable is highly correlated with RF and should be ignored for analysis

Correlation 0.99736

RWB
Highly correlated

This variable is highly correlated with RDM and should be ignored for analysis

Correlation 0.98977

Reactions
Numeric

Distinct count 69
Unique (%) 0.4%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 61.837
Minimum 21
Maximum 96
Zeros (%) 0.0%

Quantile statistics

Minimum 21
5-th percentile 47
Q1 56
Median 62
Q3 68
95-th percentile 76
Maximum 96
Range 75
Interquartile range 12

Descriptive statistics

Standard deviation 9.0105
Coef of variation 0.14571
Kurtosis 0.057029
Mean 61.837
MAD 7.1723
Skewness -0.11413
Sum 1122900
Variance 81.188
Memory size 142.3 KiB
Value Count Frequency (%)  
65.0 863 4.7%
 
64.0 862 4.7%
 
60.0 833 4.6%
 
62.0 820 4.5%
 
63.0 812 4.5%
 
66.0 734 4.0%
 
58.0 716 3.9%
 
68.0 710 3.9%
 
67.0 701 3.9%
 
61.0 700 3.8%
 
Other values (58) 10408 57.2%
 

Minimum 5 values

Value Count Frequency (%)  
21.0 1 0.0%
 
30.0 5 0.0%
 
31.0 5 0.0%
 
32.0 6 0.0%
 
33.0 12 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
92.0 1 0.0%
 
93.0 1 0.0%
 
94.0 1 0.0%
 
95.0 1 0.0%
 
96.0 1 0.0%
 

Real Face
Categorical

Distinct count 3
Unique (%) 0.0%
Missing (%) 0.3%
Missing (n) 48
No
16505
Yes
 
1654
(Missing)
 
48
Value Count Frequency (%)  
No 16505 90.7%
 
Yes 1654 9.1%
 
(Missing) 48 0.3%
 

Release Clause €
Highly correlated

This variable is highly correlated with Value € and should be ignored for analysis

Correlation 0.99435

ST
Highly correlated

This variable is highly correlated with LS and should be ignored for analysis

Correlation 1

ShortPassing
Highly correlated

This variable is highly correlated with RCM and should be ignored for analysis

Correlation 0.92691

ShotPower
Numeric

Distinct count 93
Unique (%) 0.5%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 55.46
Minimum 2
Maximum 95
Zeros (%) 0.0%

Quantile statistics

Minimum 2
5-th percentile 21
Q1 45
Median 59
Q3 68
95-th percentile 78
Maximum 95
Range 93
Interquartile range 23

Descriptive statistics

Standard deviation 17.238
Coef of variation 0.31082
Kurtosis -0.33517
Mean 55.46
MAD 13.951
Skewness -0.68047
Sum 1007100
Variance 297.15
Memory size 142.3 KiB
Value Count Frequency (%)  
68.0 574 3.2%
 
70.0 558 3.1%
 
65.0 552 3.0%
 
62.0 542 3.0%
 
66.0 515 2.8%
 
60.0 497 2.7%
 
64.0 491 2.7%
 
63.0 473 2.6%
 
58.0 471 2.6%
 
59.0 462 2.5%
 
Other values (82) 13024 71.5%
 

Minimum 5 values

Value Count Frequency (%)  
2.0 1 0.0%
 
3.0 1 0.0%
 
6.0 2 0.0%
 
7.0 1 0.0%
 
8.0 2 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
91.0 2 0.0%
 
92.0 3 0.0%
 
93.0 1 0.0%
 
94.0 1 0.0%
 
95.0 1 0.0%
 

Skill Moves
Numeric

Distinct count 6
Unique (%) 0.0%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 2.3613
Minimum 1
Maximum 5
Zeros (%) 0.0%

Quantile statistics

Minimum 1
5-th percentile 1
Q1 2
Median 2
Q3 3
95-th percentile 4
Maximum 5
Range 4
Interquartile range 1

Descriptive statistics

Standard deviation 0.75616
Coef of variation 0.32023
Kurtosis -0.087602
Mean 2.3613
MAD 0.6446
Skewness 0.1499
Sum 42879
Variance 0.57178
Memory size 142.3 KiB
Value Count Frequency (%)  
2.0 8565 47.0%
 
3.0 6600 36.2%
 
1.0 2026 11.1%
 
4.0 917 5.0%
 
5.0 51 0.3%
 
(Missing) 48 0.3%
 

Minimum 5 values

Value Count Frequency (%)  
1.0 2026 11.1%
 
2.0 8565 47.0%
 
3.0 6600 36.2%
 
4.0 917 5.0%
 
5.0 51 0.3%
 

Maximum 5 values

Value Count Frequency (%)  
1.0 2026 11.1%
 
2.0 8565 47.0%
 
3.0 6600 36.2%
 
4.0 917 5.0%
 
5.0 51 0.3%
 

SlidingTackle
Highly correlated

This variable is highly correlated with StandingTackle and should be ignored for analysis

Correlation 0.97466

Special
Numeric

Distinct count 1422
Unique (%) 7.8%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1597.8
Minimum 731
Maximum 2346
Zeros (%) 0.0%

Quantile statistics

Minimum 731
5-th percentile 1043
Q1 1457
Median 1635
Q3 1787
95-th percentile 1977
Maximum 2346
Range 1615
Interquartile range 330

Descriptive statistics

Standard deviation 272.59
Coef of variation 0.1706
Kurtosis 0.33808
Mean 1597.8
MAD 211.76
Skewness -0.7058
Sum 29091325
Variance 74303
Memory size 142.3 KiB
Value Count Frequency (%)  
1745 47 0.3%
 
1628 46 0.3%
 
1728 46 0.3%
 
1655 42 0.2%
 
1740 42 0.2%
 
1667 41 0.2%
 
1649 41 0.2%
 
1691 41 0.2%
 
1599 40 0.2%
 
1694 40 0.2%
 
Other values (1412) 17781 97.7%
 

Minimum 5 values

Value Count Frequency (%)  
731 1 0.0%
 
738 1 0.0%
 
752 2 0.0%
 
755 1 0.0%
 
757 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
2278 1 0.0%
 
2279 2 0.0%
 
2280 1 0.0%
 
2281 1 0.0%
 
2346 1 0.0%
 

SprintSpeed
Highly correlated

This variable is highly correlated with Acceleration and should be ignored for analysis

Correlation 0.92193

Stamina
Numeric

Distinct count 86
Unique (%) 0.5%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 63.22
Minimum 12
Maximum 96
Zeros (%) 0.0%

Quantile statistics

Minimum 12
5-th percentile 29
Q1 56
Median 66
Q3 74
95-th percentile 84
Maximum 96
Range 84
Interquartile range 18

Descriptive statistics

Standard deviation 15.895
Coef of variation 0.25142
Kurtosis 0.48157
Mean 63.22
MAD 12.195
Skewness -0.9154
Sum 1148000
Variance 252.64
Memory size 142.3 KiB
Value Count Frequency (%)  
68.0 694 3.8%
 
69.0 668 3.7%
 
67.0 630 3.5%
 
72.0 611 3.4%
 
65.0 587 3.2%
 
74.0 577 3.2%
 
66.0 574 3.2%
 
70.0 572 3.1%
 
75.0 556 3.1%
 
64.0 549 3.0%
 
Other values (75) 12141 66.7%
 

Minimum 5 values

Value Count Frequency (%)  
12.0 1 0.0%
 
13.0 2 0.0%
 
14.0 9 0.0%
 
15.0 4 0.0%
 
16.0 22 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
92.0 58 0.3%
 
93.0 27 0.1%
 
94.0 7 0.0%
 
95.0 2 0.0%
 
96.0 3 0.0%
 

StandingTackle
Highly correlated

This variable is highly correlated with Marking and should be ignored for analysis

Correlation 0.90654

Strength
Numeric

Distinct count 75
Unique (%) 0.4%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 65.312
Minimum 17
Maximum 97
Zeros (%) 0.0%

Quantile statistics

Minimum 17
5-th percentile 41
Q1 58
Median 67
Q3 74
95-th percentile 84
Maximum 97
Range 80
Interquartile range 16

Descriptive statistics

Standard deviation 12.557
Coef of variation 0.19226
Kurtosis 0.057389
Mean 65.312
MAD 9.922
Skewness -0.46836
Sum 1186000
Variance 157.68
Memory size 142.3 KiB
Value Count Frequency (%)  
68.0 678 3.7%
 
67.0 626 3.4%
 
65.0 623 3.4%
 
69.0 605 3.3%
 
70.0 596 3.3%
 
73.0 569 3.1%
 
72.0 569 3.1%
 
64.0 564 3.1%
 
66.0 561 3.1%
 
75.0 541 3.0%
 
Other values (64) 12227 67.2%
 

Minimum 5 values

Value Count Frequency (%)  
17.0 1 0.0%
 
24.0 2 0.0%
 
25.0 2 0.0%
 
26.0 4 0.0%
 
27.0 4 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
92.0 58 0.3%
 
93.0 29 0.2%
 
94.0 14 0.1%
 
95.0 3 0.0%
 
97.0 1 0.0%
 

Value €
Numeric

Distinct count 217
Unique (%) 1.2%
Missing (%) 1.4%
Missing (n) 252
Infinite (%) 0.0%
Infinite (n) 0
Mean 2444500
Minimum 10000
Maximum 118500000
Zeros (%) 0.0%

Quantile statistics

Minimum 10000
5-th percentile 100000
Q1 325000
Median 700000
Q3 2100000
95-th percentile 10000000
Maximum 118500000
Range 118490000
Interquartile range 1775000

Descriptive statistics

Standard deviation 5626700
Coef of variation 2.3018
Kurtosis 76.003
Mean 2444500
MAD 2783700
Skewness 7.0316
Sum 43892000000
Variance 31660000000000
Memory size 142.3 KiB
Value Count Frequency (%)  
1100000.0 431 2.4%
 
375000.0 372 2.0%
 
425000.0 354 1.9%
 
325000.0 351 1.9%
 
450000.0 343 1.9%
 
525000.0 338 1.9%
 
350000.0 325 1.8%
 
1200000.0 324 1.8%
 
400000.0 323 1.8%
 
1000000.0 318 1.7%
 
Other values (206) 14476 79.5%
 

Minimum 5 values

Value Count Frequency (%)  
10000.0 15 0.1%
 
20000.0 21 0.1%
 
30000.0 23 0.1%
 
40000.0 65 0.4%
 
50000.0 127 0.7%
 

Maximum 5 values

Value Count Frequency (%)  
89000000.0 1 0.0%
 
93000000.0 1 0.0%
 
102000000.0 1 0.0%
 
110500000.0 1 0.0%
 
118500000.0 1 0.0%
 

Vision
Numeric

Distinct count 86
Unique (%) 0.5%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 53.401
Minimum 10
Maximum 94
Zeros (%) 0.0%

Quantile statistics

Minimum 10
5-th percentile 29
Q1 44
Median 55
Q3 64
95-th percentile 74
Maximum 94
Range 84
Interquartile range 20

Descriptive statistics

Standard deviation 14.147
Coef of variation 0.26492
Kurtosis -0.35231
Mean 53.401
MAD 11.525
Skewness -0.35075
Sum 969710
Variance 200.13
Memory size 142.3 KiB
Value Count Frequency (%)  
58.0 595 3.3%
 
55.0 565 3.1%
 
60.0 541 3.0%
 
59.0 536 2.9%
 
62.0 513 2.8%
 
65.0 507 2.8%
 
64.0 498 2.7%
 
53.0 472 2.6%
 
63.0 470 2.6%
 
52.0 469 2.6%
 
Other values (75) 12993 71.4%
 

Minimum 5 values

Value Count Frequency (%)  
10.0 6 0.0%
 
11.0 20 0.1%
 
12.0 20 0.1%
 
13.0 17 0.1%
 
14.0 24 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
90.0 1 0.0%
 
91.0 3 0.0%
 
92.0 2 0.0%
 
93.0 1 0.0%
 
94.0 2 0.0%
 

Volleys
Numeric

Distinct count 88
Unique (%) 0.5%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 42.909
Minimum 4
Maximum 90
Zeros (%) 0.0%

Quantile statistics

Minimum 4
5-th percentile 12
Q1 30
Median 44
Q3 57
95-th percentile 70
Maximum 90
Range 86
Interquartile range 27

Descriptive statistics

Standard deviation 17.694
Coef of variation 0.41237
Kurtosis -0.7162
Mean 42.909
MAD 14.697
Skewness -0.14865
Sum 779180
Variance 313.09
Memory size 142.3 KiB
Value Count Frequency (%)  
49.0 402 2.2%
 
59.0 387 2.1%
 
48.0 387 2.1%
 
58.0 382 2.1%
 
45.0 374 2.1%
 
42.0 369 2.0%
 
43.0 368 2.0%
 
55.0 359 2.0%
 
52.0 357 2.0%
 
54.0 354 1.9%
 
Other values (77) 14420 79.2%
 

Minimum 5 values

Value Count Frequency (%)  
4.0 7 0.0%
 
5.0 90 0.5%
 
6.0 101 0.6%
 
7.0 110 0.6%
 
8.0 137 0.8%
 

Maximum 5 values

Value Count Frequency (%)  
86.0 7 0.0%
 
87.0 5 0.0%
 
88.0 3 0.0%
 
89.0 1 0.0%
 
90.0 5 0.0%
 

Wage €
Numeric

Distinct count 144
Unique (%) 0.8%
Missing (%) 1.3%
Missing (n) 241
Infinite (%) 0.0%
Infinite (n) 0
Mean 9861.9
Minimum 1000
Maximum 565000
Zeros (%) 0.0%

Quantile statistics

Minimum 1000
5-th percentile 1000
Q1 1000
Median 3000
Q3 9000
95-th percentile 40000
Maximum 565000
Range 564000
Interquartile range 8000

Descriptive statistics

Standard deviation 22117
Coef of variation 2.2427
Kurtosis 99.693
Mean 9861.9
MAD 10584
Skewness 7.868
Sum 177180000
Variance 489170000
Memory size 142.3 KiB
Value Count Frequency (%)  
1000.0 4900 26.9%
 
2000.0 2827 15.5%
 
3000.0 1857 10.2%
 
4000.0 1255 6.9%
 
5000.0 869 4.8%
 
6000.0 682 3.7%
 
7000.0 488 2.7%
 
8000.0 423 2.3%
 
9000.0 328 1.8%
 
10000.0 319 1.8%
 
Other values (133) 4018 22.1%
 

Minimum 5 values

Value Count Frequency (%)  
1000.0 4900 26.9%
 
2000.0 2827 15.5%
 
3000.0 1857 10.2%
 
4000.0 1255 6.9%
 
5000.0 869 4.8%
 

Maximum 5 values

Value Count Frequency (%)  
380000.0 1 0.0%
 
405000.0 1 0.0%
 
420000.0 1 0.0%
 
455000.0 1 0.0%
 
565000.0 1 0.0%
 

Weak Foot
Numeric

Distinct count 6
Unique (%) 0.0%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 2.9473
Minimum 1
Maximum 5
Zeros (%) 0.0%

Quantile statistics

Minimum 1
5-th percentile 2
Q1 3
Median 3
Q3 3
95-th percentile 4
Maximum 5
Range 4
Interquartile range 0

Descriptive statistics

Standard deviation 0.66046
Coef of variation 0.22409
Kurtosis 0.6579
Mean 2.9473
MAD 0.42629
Skewness 0.1384
Sum 53520
Variance 0.4362
Memory size 142.3 KiB
Value Count Frequency (%)  
3.0 11349 62.3%
 
2.0 3761 20.7%
 
4.0 2662 14.6%
 
5.0 229 1.3%
 
1.0 158 0.9%
 
(Missing) 48 0.3%
 

Minimum 5 values

Value Count Frequency (%)  
1.0 158 0.9%
 
2.0 3761 20.7%
 
3.0 11349 62.3%
 
4.0 2662 14.6%
 
5.0 229 1.3%
 

Maximum 5 values

Value Count Frequency (%)  
1.0 158 0.9%
 
2.0 3761 20.7%
 
3.0 11349 62.3%
 
4.0 2662 14.6%
 
5.0 229 1.3%
 

WeightLbs
Numeric

Distinct count 58
Unique (%) 0.3%
Missing (%) 0.3%
Missing (n) 48
Infinite (%) 0.0%
Infinite (n) 0
Mean 165.98
Minimum 110
Maximum 243
Zeros (%) 0.0%

Quantile statistics

Minimum 110
5-th percentile 141
Q1 154
Median 165
Q3 176
95-th percentile 194
Maximum 243
Range 133
Interquartile range 22

Descriptive statistics

Standard deviation 15.593
Coef of variation 0.093948
Kurtosis 0.091381
Mean 165.98
MAD 12.422
Skewness 0.21659
Sum 3014000
Variance 243.15
Memory size 142.3 KiB
Value Count Frequency (%)  
165.0 1483 8.1%
 
154.0 1439 7.9%
 
176.0 1041 5.7%
 
172.0 987 5.4%
 
159.0 946 5.2%
 
161.0 936 5.1%
 
163.0 901 4.9%
 
170.0 860 4.7%
 
168.0 836 4.6%
 
174.0 705 3.9%
 
Other values (47) 8025 44.1%
 

Minimum 5 values

Value Count Frequency (%)  
110.0 2 0.0%
 
115.0 1 0.0%
 
117.0 6 0.0%
 
119.0 4 0.0%
 
121.0 10 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
227.0 2 0.0%
 
229.0 1 0.0%
 
234.0 1 0.0%
 
236.0 2 0.0%
 
243.0 1 0.0%
 

Work Rate
Categorical

Distinct count 10
Unique (%) 0.1%
Missing (%) 0.3%
Missing (n) 48
Medium/ Medium
9810
High/ Medium
3173
Medium/ High
 
1690
Other values (6)
3486
Value Count Frequency (%)  
Medium/ Medium 9810 53.9%
 
High/ Medium 3173 17.4%
 
Medium/ High 1690 9.3%
 
High/ High 1015 5.6%
 
Medium/ Low 850 4.7%
 
High/ Low 699 3.8%
 
Low/ Medium 449 2.5%
 
Low/ High 439 2.4%
 
Low/ Low 34 0.2%
 
(Missing) 48 0.3%
 

Correlations

Sample

Order ID Name Age Photo Nationality Flag Overall Potential Club Club Logo Value € Wage € Special Preferred Foot International Reputation Weak Foot Skill Moves Work Rate Body Type Real Face Position Jersey Number Joined Loaned_From Contract Valid Until HeightMts WeightLbs LS ST RS LW LF CF RF RW LAM CAM RAM LM LCM CM RCM RM LWB LDM CDM RDM RWB LB LCB CB RCB RB Crossing Finishing HeadingAccuracy ShortPassing Volleys Dribbling Curve FKAccuracy LongPassing BallControl Acceleration SprintSpeed Agility Reactions Balance ShotPower Jumping Stamina Strength LongShots Aggression Interceptions Positioning Vision Penalties Composure Marking StandingTackle SlidingTackle GKDiving GKHandling GKKicking GKPositioning GKReflexes Release Clause €
0 0 158023 L. Messi 31 https://cdn.sofifa.org/players/4/19/158023.png Argentina https://cdn.sofifa.org/flags/52.png 94 94 FC Barcelona https://cdn.sofifa.org/teams/2/light/241.png 110500000.0 565000.0 2202 Left 5.0 4.0 4.0 Medium/ Medium Messi Yes RF 10.0 1-Jul-04 NaN 2021 1.70 159.0 88 88 88 92 93 93 93 92 93 93 93 91 84 84 84 91 64 61 61 61 64 59 47 47 47 59 84.0 95.0 70.0 90.0 86.0 97.0 93.0 94.0 87.0 96.0 91.0 86.0 91.0 95.0 95.0 85.0 68.0 72.0 59.0 94.0 48.0 22.0 94.0 94.0 75.0 96.0 33.0 28.0 26.0 6.0 11.0 15.0 14.0 8.0 226500000.0
1 1 20801 Cristiano Ronaldo 33 https://cdn.sofifa.org/players/4/19/20801.png Portugal https://cdn.sofifa.org/flags/38.png 94 94 Juventus https://cdn.sofifa.org/teams/2/light/45.png 77000000.0 405000.0 2228 Right 5.0 4.0 5.0 High/ Low C. Ronaldo Yes ST 7.0 10-Jul-18 NaN 2022 1.88 183.0 91 91 91 89 90 90 90 89 88 88 88 88 81 81 81 88 65 61 61 61 65 61 53 53 53 61 84.0 94.0 89.0 81.0 87.0 88.0 81.0 76.0 77.0 94.0 89.0 91.0 87.0 96.0 70.0 95.0 95.0 88.0 79.0 93.0 63.0 29.0 95.0 82.0 85.0 95.0 28.0 31.0 23.0 7.0 11.0 15.0 14.0 11.0 127100000.0
2 2 190871 Neymar Jr 26 https://cdn.sofifa.org/players/4/19/190871.png Brazil https://cdn.sofifa.org/flags/54.png 92 93 Paris Saint-Germain https://cdn.sofifa.org/teams/2/light/73.png 118500000.0 290000.0 2143 Right 5.0 5.0 5.0 High/ Medium Neymar Yes LW 10.0 3-Aug-17 NaN 2022 1.75 150.0 84 84 84 89 89 89 89 89 89 89 89 88 81 81 81 88 65 60 60 60 65 60 47 47 47 60 79.0 87.0 62.0 84.0 84.0 96.0 88.0 87.0 78.0 95.0 94.0 90.0 96.0 94.0 84.0 80.0 61.0 81.0 49.0 82.0 56.0 36.0 89.0 87.0 81.0 94.0 27.0 24.0 33.0 9.0 9.0 15.0 15.0 11.0 228100000.0
3 3 193080 De Gea 27 https://cdn.sofifa.org/players/4/19/193080.png Spain https://cdn.sofifa.org/flags/45.png 91 93 Manchester United https://cdn.sofifa.org/teams/2/light/11.png 72000000.0 260000.0 1471 Right 4.0 3.0 1.0 Medium/ Medium Lean Yes GK 1.0 1-Jul-11 NaN 2020 1.93 168.0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 17.0 13.0 21.0 50.0 13.0 18.0 21.0 19.0 51.0 42.0 57.0 58.0 60.0 90.0 43.0 31.0 67.0 43.0 64.0 12.0 38.0 30.0 12.0 68.0 40.0 68.0 15.0 21.0 13.0 90.0 85.0 87.0 88.0 94.0 138600000.0
4 4 192985 K. De Bruyne 27 https://cdn.sofifa.org/players/4/19/192985.png Belgium https://cdn.sofifa.org/flags/7.png 91 92 Manchester City https://cdn.sofifa.org/teams/2/light/10.png 102000000.0 355000.0 2281 Right 4.0 5.0 4.0 High/ High Normal Yes RCM 7.0 30-Aug-15 NaN 2023 1.80 154.0 82 82 82 87 87 87 87 87 88 88 88 88 87 87 87 88 77 77 77 77 77 73 66 66 66 73 93.0 82.0 55.0 92.0 82.0 86.0 85.0 83.0 91.0 91.0 78.0 76.0 79.0 91.0 77.0 91.0 63.0 90.0 75.0 91.0 76.0 61.0 87.0 94.0 79.0 88.0 68.0 58.0 51.0 15.0 13.0 5.0 10.0 13.0 196400000.0

Insights

There is a large number of correlated variables in the original dataset, which is an ideal scenario to run a PCA and thus reduce the dimensions while maintaining the variance and the direction of the data.