Notifications
Clear all

Data Integrity?

5 Posts
3 Users
0 Reactions
637 Views
(@ttombobadly)
Posts: 274
Reputable Member
Topic starter
 

Blade or Cash - quick question - I was reviewing the detailed matchup on Red Sox and Mariners and it lists Porcello's 6/18/2016 start against the Mariners as a 6-2 win, in which he gave up 4 ER. I validated the score via google that it was a 6-2 win, but obviously Porcello couldn't have given up four runs (he gave up 2). I feel like I may have spotted such inconsistencies before never really followed up - can you guys weigh in? Thanks

 
Posted : August 3, 2016 1:46 pm
(@michael-cash)
Posts: 7610
Member Moderator
 

I have asked the provider of the raw data to investigate. It looks like it's correct on Porcello's pitcher page but incorrect within the matchup but I do not know what would cause that.

Just an FYI TheSpread purchases raw data and parses it for our own use just like pretty much all other sites do except ESPN and the league itself. So if there is an error in the data it's an error that comes down in the raw data, not anything to do with the way we manipulate it.

If you ever notice anything feel free to send it in using the contact us link at the top of the page. Sometimes I don't stop by the forum for days at a time so if there is an issue, that is the quickest way to get it to someone who can look into it.

If I can find out what happened in this instance I will be sure to share it.

 
Posted : August 3, 2016 5:02 pm
(@blade)
Posts: 318493
Illustrious Member
 

I work with tons of data every day at work for horse racing and you wouldn't believe the amount of data we pay for that is wrong, unless we or a customer catches it it goes unnoticed. Like Cash said your data is only as good as what's provided to you unfortunately.

Currently I'm trying to compile data for French Harness racing which to say has been a nightmare would be the under statement of the century lol

 
Posted : August 3, 2016 11:52 pm
(@ttombobadly)
Posts: 274
Reputable Member
Topic starter
 

Thanks for the info! I did a couple years of data analysis for a large CPG company that sold in every major retailer and grocery chain you can think of.. I had this broken down to specific stores by day product, etc.. it can definitely be a nightmare and even more frustrating, especially to upper management who's bene paying for said crap data. You have no idea how bad I've wanted to pay for one of those databases so I could do my own slice and dice on everything. If anyone has an old excel file they want to share, let me know 😉

 
Posted : August 4, 2016 3:53 pm
(@michael-cash)
Posts: 7610
Member Moderator
 

I am sure a bunch of people scrape our stuff and save it but, I don't know how successful you will be getting someone to fess up and probably even less successful getting someone to actually agree to share it.

I have a ton of confidence in our data provider but at the end of the day mistakes do happen. All of this stuff no matter where it comes from starts with humans. A computer can't watch a game and calculate balls and strikes or first downs or what have you.

We have been notified of issues from time to time and also found stuff on our own but this particular issue is a mystery because the pitcher report has the info correct but the matchup did not. I have never seen that before. Generally when stuff is messed up its messed up on any page that info is present.

I'm still waiting for a reply from the provider but it might be one of those things where the explanation they offer doesn't satisfy you or us. But I'll stay after it nonetheless.

 
Posted : August 4, 2016 4:36 pm
Share: