<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
	<link rel="self" href="/Atom/" />
	<id>http://billmill.org/</id>
	<title>My Name Rhymes</title>
	<subtitle>Bill Mill blogs irregularly</subtitle>
	<updated>2005-04-05T23:13:00Z</updated>
	<author>
		<name>Bill Mill</name>
		<email>bill.mill@gmail.com</email>
		<uri>http://billmill.org/</uri>
	</author>
	<link href="http://billmill.org/" />
	<entry>
		<title>Python vs. Illinois</title>
		<link href="http://billmill.org/illinois.html" />	
		<id>http://billmill.org/illinois.html</id>
		<updated>2005-04-05T23:13:00Z</updated>
		<summary type="html">No, I'm not dead
yet, I've just been hibernating. And by "hibernating", I mean "watching
college basketball". As the geekier of you probably don't know, the Illinois
men's basketball team just ended a dream season tonight with a loss to North
Carolina in the NCAA tournament finals. Prior to tonight's loss, they only had
&lt;a href="http://www.msnbc.msn.com/id/7108724/"&gt;one loss&lt;/a&gt;, on a buzzerbeater,
to Ohio State.&lt;p&gt;
"What in the world", I can hear you asking, "does that have to do with Python?"
Well, on a &lt;a href="http://www.yocohoops.com"&gt;college basketball board&lt;/a&gt; I
frequent, a user named madurzak 
&lt;a href="http://www.yocohoops.com/story/2005/4/2/10954/76021"&gt;proved 
conclusively&lt;/a&gt; that the worst team, &lt;a
href="http://en.wikipedia.org/wiki/Ratings_Percentage_Index"&gt;RPI-wise&lt;/a&gt;, in
college basketball is better than Illinois:&lt;p&gt;
&lt;blockquote&gt;Maryland Eastern Shore beat&lt;br&gt;
MD Baltimore Country which beat&lt;br&gt;
Maine who beat&lt;br&gt;
Northeastern who had a nice win against&lt;br&gt;
UMass who somehow beat&lt;br&gt;
UConn who beat&lt;br&gt;
Indiana who beat&lt;br&gt;
Ohio St. who shocked&lt;br&gt;
Illinois&lt;/blockquote&gt;&lt;p&gt;
This got me to wondering what some other teams' Illinois Number (as I christened
it, after the famous &lt;a href="http://en.wikipedia.org/wiki/Erdos_number"&gt;Erdos
Number&lt;/a&gt;) would be. Armed with &lt;a href="http://kenpom.com"&gt;Ken Pomeroy's&lt;/a&gt;
(&lt;a href="#illinois_txt_foot1"&gt;[1]&lt;/a&gt;)
&lt;a href="http://kenpom.com/cbbga.txt"&gt;game data&lt;/a&gt;, Python, and a spare hour,
I set out to find out. Would every team have an Illinois Number? What would the
highest Illinois Number be?&lt;p&gt;
To figure out, I simply built a tree with Illinois at the top, Ohio State below,
the teams that beat Ohio State below that, and so on until all the games that
fit into the tree had been properly placed. I've posted the basic code to
construct the tree &lt;a 
href="http://billmill.org/static/files/gen_illinois_tree.py"&gt;here&lt;/a&gt;; it
requires the &lt;a href="http://kenpom.com/cbbga.txt"&gt;data&lt;/a&gt; I mentioned
earlier. After some massaging of the results, I threw up a quick CGI app to
explore them &lt;a
href="http://llimllib.f2o.org/illinois_number/illinois.html"&gt;here&lt;/a&gt;.&lt;p&gt;
&lt;h2&gt;Conclusion&lt;/h2&gt;&lt;p&gt;
There's no deep conclusion; I just thought it was a neat little script, so I
wanted to post it. There are a few interesting thoughts I had on the data,
though:
&lt;ul&gt;&lt;li&gt;The highest Illinois Number is 7, and it belongs to the 
little Division III squad of &lt;a
href="http://www.hsc.edu/athletics/basketball/"&gt;Hampden Sydney&lt;/a&gt;, an 
all-men's school in Virginia. They &lt;a
href="http://llimllib.f2o.org/illinois_number/illinois.py?team=hampden+sydney"&gt;beat&lt;/a&gt;
the &lt;a
href="http://www.longwoodlancers.com/index.asp?path=mbasket"&gt;Longwood
Lancers&lt;/a&gt;, who managed only one win all season.
&lt;li&gt;Both Division I one-win teams, &lt;a
href="http://llimllib.f2o.org/illinois_number/illinois.py?team=army"&gt;Army&lt;/a&gt;
and the aforementioned &lt;a
href="http://llimllib.f2o.org/illinois_number/illinois.py?team=longwood"&gt;Longwood&lt;/a&gt;,
had Illinois Numbers of 6
&lt;li&gt;As far as I can tell, the only Division I school without an Illinois Number
is lowly Savannah State, who went 0-28 this season.
&lt;li&gt;For this type of task - parsing text, building a relatively small tree, and
playing with it - Python is the best. I love playing with data in the
interpreter, coding very very high-level data structures in minutes, and trying
things out as fast as I can type them.&lt;p&gt;&lt;/ul&gt;
&lt;a name="illinois_txt_foot1"&gt;[1]: Let me mention that, if you're a stats geek,
you should have been reading &lt;a href="http://kenpom.com"&gt;Ken's site&lt;/a&gt; for a
while. Go there for his interesting custom college basketball ranking system,
stay for the game analysis on his blog.&lt;p&gt;
[2]: The data analyzed here does not include tonight's North Carolina game,
since I coded everything this afternoon. Tonight was the first time I was 
rooting for a sports team just to keep my blog consistent, but NC's defense was just too good for the Fightin' Illini.
</summary>
		<content type="html">No, I'm not dead
yet, I've just been hibernating. And by "hibernating", I mean "watching
college basketball". As the geekier of you probably don't know, the Illinois
men's basketball team just ended a dream season tonight with a loss to North
Carolina in the NCAA tournament finals. Prior to tonight's loss, they only had
&lt;a href="http://www.msnbc.msn.com/id/7108724/"&gt;one loss&lt;/a&gt;, on a buzzerbeater,
to Ohio State.&lt;p&gt;
"What in the world", I can hear you asking, "does that have to do with Python?"
Well, on a &lt;a href="http://www.yocohoops.com"&gt;college basketball board&lt;/a&gt; I
frequent, a user named madurzak 
&lt;a href="http://www.yocohoops.com/story/2005/4/2/10954/76021"&gt;proved 
conclusively&lt;/a&gt; that the worst team, &lt;a
href="http://en.wikipedia.org/wiki/Ratings_Percentage_Index"&gt;RPI-wise&lt;/a&gt;, in
college basketball is better than Illinois:&lt;p&gt;
&lt;blockquote&gt;Maryland Eastern Shore beat&lt;br&gt;
MD Baltimore Country which beat&lt;br&gt;
Maine who beat&lt;br&gt;
Northeastern who had a nice win against&lt;br&gt;
UMass who somehow beat&lt;br&gt;
UConn who beat&lt;br&gt;
Indiana who beat&lt;br&gt;
Ohio St. who shocked&lt;br&gt;
Illinois&lt;/blockquote&gt;&lt;p&gt;
This got me to wondering what some other teams' Illinois Number (as I christened
it, after the famous &lt;a href="http://en.wikipedia.org/wiki/Erdos_number"&gt;Erdos
Number&lt;/a&gt;) would be. Armed with &lt;a href="http://kenpom.com"&gt;Ken Pomeroy's&lt;/a&gt;
(&lt;a href="#illinois_txt_foot1"&gt;[1]&lt;/a&gt;)
&lt;a href="http://kenpom.com/cbbga.txt"&gt;game data&lt;/a&gt;, Python, and a spare hour,
I set out to find out. Would every team have an Illinois Number? What would the
highest Illinois Number be?&lt;p&gt;
To figure out, I simply built a tree with Illinois at the top, Ohio State below,
the teams that beat Ohio State below that, and so on until all the games that
fit into the tree had been properly placed. I've posted the basic code to
construct the tree &lt;a 
href="http://billmill.org/static/files/gen_illinois_tree.py"&gt;here&lt;/a&gt;; it
requires the &lt;a href="http://kenpom.com/cbbga.txt"&gt;data&lt;/a&gt; I mentioned
earlier. After some massaging of the results, I threw up a quick CGI app to
explore them &lt;a
href="http://llimllib.f2o.org/illinois_number/illinois.html"&gt;here&lt;/a&gt;.&lt;p&gt;
&lt;h2&gt;Conclusion&lt;/h2&gt;&lt;p&gt;
There's no deep conclusion; I just thought it was a neat little script, so I
wanted to post it. There are a few interesting thoughts I had on the data,
though:
&lt;ul&gt;&lt;li&gt;The highest Illinois Number is 7, and it belongs to the 
little Division III squad of &lt;a
href="http://www.hsc.edu/athletics/basketball/"&gt;Hampden Sydney&lt;/a&gt;, an 
all-men's school in Virginia. They &lt;a
href="http://llimllib.f2o.org/illinois_number/illinois.py?team=hampden+sydney"&gt;beat&lt;/a&gt;
the &lt;a
href="http://www.longwoodlancers.com/index.asp?path=mbasket"&gt;Longwood
Lancers&lt;/a&gt;, who managed only one win all season.
&lt;li&gt;Both Division I one-win teams, &lt;a
href="http://llimllib.f2o.org/illinois_number/illinois.py?team=army"&gt;Army&lt;/a&gt;
and the aforementioned &lt;a
href="http://llimllib.f2o.org/illinois_number/illinois.py?team=longwood"&gt;Longwood&lt;/a&gt;,
had Illinois Numbers of 6
&lt;li&gt;As far as I can tell, the only Division I school without an Illinois Number
is lowly Savannah State, who went 0-28 this season.
&lt;li&gt;For this type of task - parsing text, building a relatively small tree, and
playing with it - Python is the best. I love playing with data in the
interpreter, coding very very high-level data structures in minutes, and trying
things out as fast as I can type them.&lt;p&gt;&lt;/ul&gt;
&lt;a name="illinois_txt_foot1"&gt;[1]: Let me mention that, if you're a stats geek,
you should have been reading &lt;a href="http://kenpom.com"&gt;Ken's site&lt;/a&gt; for a
while. Go there for his interesting custom college basketball ranking system,
stay for the game analysis on his blog.&lt;p&gt;
[2]: The data analyzed here does not include tonight's North Carolina game,
since I coded everything this afternoon. Tonight was the first time I was 
rooting for a sports team just to keep my blog consistent, but NC's defense was just too good for the Fightin' Illini.
</content>
	</entry>
</feed>
