I need to get a new stats program that accurately reports successful downloads of binaries.
The default webalizer is fine for pages and images, but because of byte serving of large files it’s an almost impossible task to determine from the logs exactly how many times a large PDF was downloaded. For a given user-session-IP, one can see as many as 20-30 GET requests for the same document. one could write (and I have, but I don’t trust it at all) an analyzer to aggregate those and correlate them to a 1 successful download. But then we also have the well known issue of browsers hanging where someone is still silly enough to use the PDF web plug-in (installed by default in most IE defaults) which tries to open a single PDF Acrobat thru a Web browser window. Typically, this has been known to fail in cases where the PDF is over 5MB in size. But, who knows if the browser does not continue to issue GET requests for the whole document, even as the user experience is that his system hung? I am unable to see from the logs any cases of failed downloads, which is “insane” because there has to be at least 1 or 2 such cases.
We typically serve large PDFs, running from 18 MG to 48 MB in size. I have some desktop client agents that download the file in one go and I can see these in the webalizer logs as a single GET request. But all browser requests are byte served in parts. XOOPS framework has the number of downloads, but these only register where the user actually gets the PDF through that GUI. But the PDF’s are linking in many other locations, pages and other sites such that a large number of successful hits will not appear in the XOOPS downloads count. Google Urchin, does not do binaries… So, bottom line: we are completely in the dark when someone asks “How many successful downloads of MyIncredibleDocISpentMonthsMaking.pdf were there this month?” and soon we will be also deliver free.epub (=.zip) files and I expect to have the same issue (I don’t know if .zip are served in parts or not)
Are there an stats packages/solutions for this already available that we an install?
Any insights will be much appreciated.
Thanks!