FotMob¶
- class soccerdata.FotMob(leagues=None, seasons=None, proxy=None, no_cache=False, no_store=False, data_dir=PosixPath('/home/docs/soccerdata/data/FotMob'))¶
Provides pd.DataFrames from data available at http://www.fotmob.com.
Data will be downloaded as necessary and cached locally in
~/soccerdata/data/FotMob
.- Parameters:
leagues (string or iterable, optional) – IDs of Leagues to include.
seasons (string, int or list, optional) – Seasons to include. Supports multiple formats. Examples: ‘16-17’; 2016; ‘2016-17’; [14, 15, 16]
proxy ('tor' or dict or list(dict) or callable, optional) –
- Use a proxy to hide your IP address. Valid options are:
’tor’: Uses the Tor network. Tor should be running in the background on port 9050.
dict: A dictionary with the proxy to use. The dict should be a mapping of supported protocols to proxy addresses. For example:
{ 'http': 'http://10.10.1.10:3128', 'https': 'http://10.10.1.10:1080', }
list(dict): A list of proxies to choose from. A different proxy will be selected from this list after failed requests, allowing rotating proxies.
callable: A function that returns a valid proxy. This function will be called after failed requests, allowing rotating proxies.
no_cache (bool) – If True, will not use cached data.
no_store (bool) – If True, will not store downloaded data.
data_dir (Path) – Path to directory where data will be cached.
- read_leagues()¶
Retrieve the selected leagues from the datasource.
- Return type:
pd.DataFrame
- read_seasons()¶
Retrieve the selected seasons for the selected leagues.
- Return type:
pd.DataFrame
- read_league_table(force_cache=False)¶
Retrieve the league table for the selected leagues.
- Parameters:
force_cache (bool) – By default no cached data is used for the current season. If True, will force the use of cached data anyway.
- Return type:
pd.DataFrame
- read_schedule(force_cache=False)¶
Retrieve the game schedule for the selected leagues and seasons.
- Parameters:
force_cache (bool) – By default no cached data is used for the current season. If True, will force the use of cached data anyway.
- Return type:
pd.DataFrame
- read_team_match_stats(stat_type='Top stats', opponent_stats=True, team=None, force_cache=False)¶
Retrieve the match stats for the selected leagues and seasons.
- The following stat types are available:
‘Top stats’
‘Shots’
‘Expected goals (xG)’
‘Passes’
‘Defence’
‘Duels’
‘Discipline’
- Parameters:
stat_type (str) – Type of stats to retrieve.
opponent_stats (bool) – If True, will retrieve opponent stats.
team (str or list of str, optional) – Team(s) to retrieve. If None, will retrieve all teams.
force_cache (bool) – By default no cached data is used to scrape the list of available games for the current season. If True, will force the use of cached data anyway.
- Raises:
TypeError – If
stat_type
is not valid.ValueError – If no games with the given IDs were found for the selected seasons and leagues.
- Return type:
pd.DataFrame
- classmethod available_leagues()¶
Return a list of league IDs available for this source.
- Return type:
List[str]