Archive for January, 2007

Why Did Google Buy Youtube?

January 25th, 2007

It seems like one of the major reasons is that they just wanted to improve their video search. With youtube included in the search results people might actually start using the google video search.


America and Terror in the State of the Union Address

January 25th, 2007

So I am now working at Swivel part time finding, compiling, and uploading interesting data. My new coworker Seema had the idea to graph the occurances of certain words in State of the Union Addresses over time. Here are two of the most interesting graphs.


The Effect of Age of Search Engine Rankings

January 24th, 2007

I did some analysis on the effect of age of domain names on search engine ranking. I ran the 300 top searchs according to wordtracker through google, msn, and yahoo. I checked the age of each site according to Archive.org and here are the results. I just wish I had done months instead of […]


Google Analytics Full Referrer URL

January 15th, 2007

One thing that really annoys me with some vistor tracking software is that it truncates the referring url at the query string.(ie someforum.com/post.php?id=354353 show up as just someforum.com/post.php) This makes it a huge hassle to find the specific referring page that your users are coming from.
Google Analytics is a good example of tracking software that […]


Top 90 Most Dugg Comments

January 15th, 2007

Here are some more statistics… This one is an update of this digg.

comment
user
+/-

1
Let’s set a new record, bitches. Digg me down.
choicetoes
1184

link to digg?

2
I don’t get it. Every time I click that link, it comes right back to here. But when I hover over it, the link is different…
Lemme try it in IE…
Nope! GODDAMNIT!
Lemme download […]


Top 1001 Duplicate Digg Comments

January 14th, 2007

I was not really satisfied with my previous post on duplicate digg comments and decided to fix my code and generate a better list. I also updated the user comment database with my new data. Just a reminder that this data is from front page stories from the last 365 days. This list was generated […]


Digg Comment Data

January 14th, 2007

Some people were interested in downloading a copy of the digg.com comment data used in these 2 posts. So I fixed a few bugs in my spider code and now the data is over a gigabyte uncompressed and contains over 4 million comments. The compressed file is about 340MB. I know a lot of […]


Amazon Germany and Amazon Japan Filler Item Finders

January 13th, 2007

I have just finished up creating the German and Japanese versions of my Amazon Filler Item Finder website. Here are the Amazon.de Filler Item Finder and the Amazon.jp Filler Item Finder. These links will be fairly useless unless you speak one of these languages and reside in one of these countries, but I […]


Digg User Comment Statistics

January 11th, 2007

Here are some user statistics taken from the comments on the top ~30,000 articles of the last year(same data as before). For the users not on one of these lists, you can check your digg commenting statistics. The best/worst comment tables only list users with at least 10 comments.
I tab delimited text file of the […]


Top 100 Duplicate Digg Comments from 2006

January 9th, 2007

UPDATE: Top 1001 Duplicate Digg Comments
With all of the “But will it blend” and “Pic it or it didnt happen” comments on digg.com I though it would be interesting to tally up the duplicate comments from the last year. My script ran through 1,255,627 comments from the ~30,000 most popular stories of last year. […]