Data Mining beyond Adventure Works @ Atlanta SQL Saturday
Mark Tabladillo, Ph.D.
April 25, 2009
Demonstration One: Baseball Management
Directions: You are on the management team for the Atlanta Braves. To better serve the team, you have been
instructed by the owner to group the players by considering both their position and their salary. The following rules
apply:
1) Players of different position may be in the same group
2) You must make more than one group
3) Each group must have at least two players
Salary Position
Team Name Group
Atlanta Braves Bernero, Adam 450,000 Pitcher
Atlanta Braves Betemit, Wilson 316,000 Shortstop
Atlanta Braves Colon, Roman 318,500 Pitcher
Atlanta Braves Estrada, Johnny 460,000 Catcher
Atlanta Braves Franco, Julio 1,000,000 First Baseman
Atlanta Braves Furcal, Rafael 5,600,000 Shortstop
Atlanta Braves Giles, Marcus 2,350,000 Second Baseman
Atlanta Braves Gryboski, Kevin 877,500 Pitcher
Atlanta Braves Hampton, Mike 15,125,000 Pitcher
Atlanta Braves Hudson, Tim 6,500,000 Pitcher
Atlanta Braves Jones, Andruw 13,000,000 Outfielder
Atlanta Braves Jones, Chipper 16,061,802 Outfielder
Atlanta Braves Jordan, Brian 600,000 Outfielder
Atlanta Braves Kolb, Dan 3,400,000 Pitcher
Atlanta Braves Langerhans, Ryan 316,000 Outfielder
Atlanta Braves LaRoche, Adam 337,500 First Baseman
Atlanta Braves Martin, Tom 1,900,000 Pitcher
Atlanta Braves Mondesi, Raul 1,000,000 Outfielder
Atlanta Braves Orr, Pete 300,000 Second Baseman
Atlanta Braves Perez, Eddie 625,000 Catcher
Atlanta Braves Ramirez, Horacio 370,000 Pitcher
Atlanta Braves Reitsma, Chris 1,650,000 Pitcher
Atlanta Braves Smoltz, John 9,000,000 Pitcher
Atlanta Braves Sosa, Jorge 650,000 Pitcher
Atlanta Braves Thomson, John 4,250,000 Pitcher
Data Mining beyond Adventure Works @ Atlanta SQL Saturday
Mark Tabladillo, Ph.D.
April 25, 2009
Demonstration Two: Government Statistics
Directions:
The President is asking your opinion on how the following numbers will increase over the next few months. Because this
project is sensitive, you do not know what these numbers measure. However, based on the available history, make your
best projection for the next five periods.
Year Period Value
2006 Jan 4.7
2006 Feb 4.8
2006 Mar 4.7
2006 Apr 4.7
2006 May 4.7
2006 Jun 4.6
2006 Jul 4.7
2006 Aug 4.7
2006 Sep 4.5
2006 Oct 4.4
2006 Nov 4.5
2006 Dec 4.4
2007 Jan 4.6
2007 Feb 4.5
2007 Mar 4.4
2007 Apr 4.5
2007 May 4.5
2007 Jun 4.6
2007 Jul 4.7
2007 Aug 4.7
2007 Sep 4.7
2007 Oct 4.8
2007 Nov 4.7
2007 Dec 4.9
2008 Jan 4.9
2008 Feb 4.8
2008 Mar 5.1
2008 Apr 5
2008 May
2008 Jun
2008 Jul
2008 Aug
2008 Sep
Data Mining beyond Adventure Works @ Atlanta SQL Saturday
Mark Tabladillo, Ph.D.
April 25, 2009
Demonstration Two: Government Statistics
Directions:
The President is asking your opinion on how the following numbers will increase over the next few months. Because this
project is sensitive, you do not know what these numbers measure. However, based on the available history, make your
best projection for the next five periods.
Year Period Value
2007 Jan 4.6
2007 Feb 4.5
2007 Mar 4.4
2007 Apr 4.5
2007 May 4.5
2007 Jun 4.6
2007 Jul 4.7
2007 Aug 4.7
2007 Sep 4.7
2007 Oct 4.8
2007 Nov 4.7
2007 Dec 4.9
2008 Jan 4.9
2008 Feb 4.8
2008 Mar 5.1
2008 Apr 5
2008 May 5.5
2008 Jun 5.6
2008 Jul 5.8
2008 Aug 6.2
2008 Sep 6.2
2008 Oct 6.6
2008 Nov 6.8
2008 Dec 7.2
2009 Jan 7.6
2009 Feb 8.1
2009 Mar 8.5
2008 Apr
2008 May
2008 Jun
2008 Jul
2008 Aug
Microsoft provides excellent tutorials and informat more
Microsoft provides excellent tutorials and information about data mining through the fictional Adventure Works demos. However, what happens when you stray off that neat-and-tidy path? Data miners should be concerned about data preparation, proper algorithm selection, and correct interpretation. This interactive experience will consist of succinct audience participation demos to introduce some practical issues in real-world data mining. less
0 comments
Post a comment