• Welcome to the Cricket Web forums, one of the biggest forums in the world dedicated to cricket.

    You are currently viewing our boards as a guest which gives you limited access to view most discussions and access our other features. By joining our free community you will have access to post topics, respond to polls, upload content and access many other special features. Registration is fast, simple and absolutely free so please, join the Cricket Web community today!

    If you have any problems with the registration process or your account login, please contact us.

One for the Statisticians - Batters Feast of Famine

lug

Cricket Spectator
Hi - first post here - for a while anyway...

Is this the best place / forum to ask a (very) statistical question? I'm trying to derive a robust means of measuring a batter's decline based on their potential for less consistency after a given event in their career? I don't want to join ACS (Association of Cricket Statisticians) as I'm not a professional Cricket statistician so is there a better free forum or can I ask the question here?

Thanks
 

chris.hinton

International Captain
Hi - first post here - for a while anyway...

Is this the best place / forum to ask a (very) statistical question? I'm trying to derive a robust means of measuring a batter's decline based on their potential for less consistency after a given event in their career? I don't want to join ACS (Association of Cricket Statisticians) as I'm not a professional Cricket statistician so is there a better free forum or can I ask the question here?

Thanks
Just ask here. I'm sure we will do our very best to answer the question
 

lug

Cricket Spectator
I can think of several batters over the years who played on past their best. In their latter years, it seemed to me that they made more very low scores and "bumped up" their averages (and therefore perhaps unjustifiably kept their places) by scoring massively against weaker opponents or on very flat pitches. The question is... How to demonstrate this.

Two problems:
1) Identify the point at which their consistency ended.
2) calculate a robust measure that will show this.


For the measure, I was thinking of the number of inns pre and post watershed that were +- 1sd of their career ave. I was thinking of trying this on those batters with (arbitrarily a 10 year test career and 6000 runs) but don't know how / where to get an inns by inns list for that many players. You can do it player by player on statsguru but I need a dataset of probably 100 batters. Cricsheet can supply it but only for the last 20 years or so but I want to go back further.

As for identifying the watershed, this presumably means subdividing the population into at least 2 samples and calculating the metric above but need to know which formula to use to select the subsamples. One obvious constraint being each subsample must contain consecutive observations i.e. inns.

A few thoughts there! Discuss! Is it statistically sound? Where to get data? Software?

Cheers
 

Coronis

Hall of Fame Member
I can think of several batters over the years who played on past their best. In their latter years, it seemed to me that they made more very low scores and "bumped up" their averages (and therefore perhaps unjustifiably kept their places) by scoring massively against weaker opponents or on very flat pitches. The question is... How to demonstrate this.

Two problems:
1) Identify the point at which their consistency ended.
2) calculate a robust measure that will show this.


For the measure, I was thinking of the number of inns pre and post watershed that were +- 1sd of their career ave. I was thinking of trying this on those batters with (arbitrarily a 10 year test career and 6000 runs) but don't know how / where to get an inns by inns list for that many players. You can do it player by player on statsguru but I need a dataset of probably 100 batters. Cricsheet can supply it but only for the last 20 years or so but I want to go back further.

As for identifying the watershed, this presumably means subdividing the population into at least 2 samples and calculating the metric above but need to know which formula to use to select the subsamples. One obvious constraint being each subsample must contain consecutive observations i.e. inns.

A few thoughts there! Discuss! Is it statistically sound? Where to get data? Software?

Cheers
fwiw, 75 batsmen have made 6000 runs, and only one played less than 10 years (Mike Hussey) About half of those played purely in the last 20 years.
 

lug

Cricket Spectator
Does anyone know if there's a way to produce an innings by innings list for a given subset of batters? I appreciate this might be a big dataset.
 

Thala_0710

International Regular
Does anyone know if there's a way to produce an innings by innings list for a given subset of batters? I appreciate this might be a big dataset.
You can go to statsguru and there's an option to filter the players u want, and an innings by innings option as well
 

lug

Cricket Spectator
I've tried Statsgugu but I can't find a way of doing a list for multiple players - only one player at a time. Do you know a way?
 

lug

Cricket Spectator
Thanks but that's a list of all innings played in England in the last 5 years and runs to 29 pages. What I want is every innings played by all players with over 5,000 test runs in the last 50 years. Is there a way to do this in one go?
 

Thala_0710

International Regular
Thanks but that's a list of all innings played in England in the last 5 years and runs to 29 pages. What I want is every innings played by all players with over 5,000 test runs in the last 50 years. Is there a way to do this in one go?
Yup, but this will require certain amount of coding
 
Last edited:

Thala_0710

International Regular
Yup, but this will require certain amount of coding

Go to this URL, there's a white arrow at the absolute right in a blue box, click on it and select batting innings for player.
Manually copy and pasting for 89 players will be tough, but chatgpt or any other ai should be able to right the required code quite easily.
 

lug

Cricket Spectator
OK - the pasting for 89 players is what I was trying to avoid.

Out of interest, is there a database anywhere I can access to do this or is everything via the front end?
 

capt_Luffy

Hall of Fame Member
OK - the pasting for 89 players is what I was trying to avoid.

Out of interest, is there a database anywhere I can access to do this or is everything via the front end?
I think the statsguru query Thala provided is a pretty clear cut database of what you are looking for. Don't think a simpler version is possible really. A little bit of coding and getting the output in a CSV should be simple imo.
 

Top