HaloJones wrote:Can you detail the specific problem you have with the downloaded files and maybe someone here can host them for you as a relay to avoid the issue?
I can download the files from my web server in San Diego, California, where I keep those files backups. That is a ssh download. Although very slow, they arrive without errors. I think I have a problem with everything from California as I can watch youtube videos and download other content at the connection full speed.
The speed would be just an inconvenience if it weren't by the corrupted files. It looks like something changed about the Stanford's http server configuration.
I will have to change the KakaoStats code to download from the web server but that will obviously require a coding session. That is because the files are not http exposed as the current download code requires. I will need to either expose then or change the current code to make a ssh download. I can only start it on the weekend.
Tracing route to vspm27.stanford.edu [171.65.103.94]
over a maximum of 30 hops:
1 <1 ms <1 ms <1 ms [router]
2 <1 ms <1 ms <1 ms [router]
3 7 ms 8 ms 7 ms d54C3F001.access.telenet.be [84.195.240.1]
4 9 ms 9 ms 9 ms dD5E0C521.access.telenet.be [213.224.197.33]
5 8 ms 10 ms 8 ms dD5E0FD59.access.telenet.be [213.224.253.89]
6 8 ms 7 ms 8 ms dD5E0FDB9.access.telenet.be [213.224.253.185]
7 10 ms 10 ms 9 ms ae0.anr11.ip4.tinet.net [77.67.65.177]
8 22 ms 42 ms 19 ms xe-1-0-0.lon11.ip4.tinet.net [89.149.185.170]
9 21 ms 25 ms 35 ms te7-6.mpd02.lon01.atlas.cogentco.com [130.117.15.49]
10 98 ms 96 ms 96 ms te0-0-0-1.mpd21.jfk02.atlas.cogentco.com [154.54.30.129]
11 141 ms 218 ms 211 ms te1-8.mpd01.ord01.atlas.cogentco.com [154.54.29.157]
12 168 ms 168 ms 163 ms te0-1-0-2.mpd21.mci01.atlas.cogentco.com [66.28.4.185]
13 292 ms 215 ms 206 ms te2-2.mpd01.sfo01.atlas.cogentco.com [154.54.6.38]
14 185 ms 215 ms 203 ms te7-4.mpd01.sjc04.atlas.cogentco.com [154.54.28.82]
15 170 ms 172 ms 170 ms Stanford_University2.demarc.cogentco.com [66.250.7.138]
16 172 ms 175 ms 176 ms boundarya-rtr.Stanford.EDU [68.65.168.33]
17 * * * Request timed out.
18 * * * Request timed out.
19 * * * Request timed out.
20 171 ms 171 ms 180 ms vspm27.Stanford.EDU [171.65.103.94]
Trace complete.
Tracing route to vspm27.stanford.edu [171.65.103.94]
over a maximum of 30 hops:
1 1 ms 1 ms 1 ms [router]
2 2 ms 1 ms 1 ms in-int.PPPoE2.escom.bg [195.24.89.9]
3 2 ms 1 ms 1 ms 195.24.88.2
4 5 ms 5 ms 5 ms 94.156.248.121
5 9 ms 5 ms 5 ms ae0-431.sof10.ip4.tinet.net [77.67.67.113]
6 42 ms 41 ms 41 ms xe-10-0-0.fra23.ip4.tinet.net [89.149.186.250]
7 56 ms 59 ms 41 ms te1-7.mpd02.fra03.atlas.cogentco.com [130.117.14.85]
8 137 ms 138 ms 137 ms te2-4.mpd01.ymq02.atlas.cogentco.com [154.54.28.93]
9 152 ms 152 ms 152 ms te8-7.mpd02.ord01.atlas.cogentco.com [66.28.4.57]
10 173 ms 174 ms 174 ms te0-4-0-0.mpd22.mci01.atlas.cogentco.com [154.54.30.178]
11 204 ms 204 ms 204 ms te4-7.mpd01.sfo01.atlas.cogentco.com [154.54.24.105]
12 216 ms 216 ms 216 ms te4-2.mpd01.sjc04.atlas.cogentco.com [154.54.2.166]
13 216 ms 232 ms 216 ms Stanford_University2.demarc.cogentco.com [66.250.7.138]
14 216 ms 214 ms 211 ms bnda-rtr-1.Stanford.EDU [68.65.168.33]
15 * * * Request timed out.
16 * * * Request timed out.
17 * * * Request timed out.
18 221 ms 210 ms 211 ms vspm27.Stanford.EDU [171.65.103.94]
Trace complete.
Kakao: I just wanted to express my gratitude for your stats page. It's a very simple but effective design for conveying a lot of information. Thanks in advance for your work in making it operational again!!!
I managed to implement a half solution. First I exposed the backup files at the web server through http. Then I pointed the download script to them. But the problems persisted showing that the problem is not with the Stanford's http server. Then I tried the hardened download script rewritten for the development version of KakaoStats and it proved to be more resilient to http connection problems successfully downloading the last few updates.
So the problem is not solved. I just have a more resilient script working and the download is very slow. My issue is very likely a routing problem which I will investigate during the next days.
There are many updates still missing in the database but I will manually feed then until the weekend.
No. There is no problem from San Diego to San Jose. I can get the files extremely fast when downloading from San Jose to San Diego. The problem is downloading to Brasília, Brazil from both San Diego and San Jose. That is where the routing problem is. There is no problem getting content from anywhere else to Brasília.
Kakao wrote:No. There is no problem from San Diego to San Jose. I can get the files extremely fast when downloading from San Jose to San Diego. The problem is downloading to Brasília, Brazil from both San Diego and San Jose. That is where the routing problem is. There is no problem getting content from anywhere else to Brasília.
.
Why, then, does a traceroute to kakaostats.com guide me to a server in the San Diego area ?
.
- stopped Linux SMP w. HT on [email protected] GHz
....................................
Folded since 10-06-04 till 09-2010
The web side of KakaoStats lives on the web server in San Diego. That is where the pages are built. The data to build the pages live in the database server in Brasília. So whenever a page is built by the web server it sends a sql query to the database. Actually there is lots of caching so the queries are much less frequent.
A database server is a much more demanding software than a web server. It would be costly to rent a machine in a data center to run the database. So it sits right in my house.
The machine in my house has 8GB memory, two HDs in Raid 1. Nothing fancy. But in the data center they charge every bit you add to the base machine.
Last edited by Kakao on Thu Apr 15, 2010 6:03 pm, edited 1 time in total.