Subject: Re: LINKS Date: Fri, 9 Apr 2004 13:35:54 -0400 To: , From: Bob Armstrong Jak & Ron , Sorry about the lost time . I don't know how I missed the previous email , but if it had a generic subject like "LINKS" , and was from "Jacobs" , I must have discarded it in the flood of spam . filters directly to my "people" file so it was captured . Google apparently doesn't convert big documents like these reports . Last night , I found 2 methods for converting PDFs to HTML : http://www.gohtm.com/ / free online conversion , emailed . ( only received files after prodding them this morning . ) This service has a limit of 2MB files so appears to balk on the 2.17MB http://www.nyc.gov/html/omb/pdf/cb7_03.pdf file http://sourceforge.net/projects/pdftohtml/ / open source conversion tool . ( Adobe's own conversion service is uselessly crude . ) Neither of these conversion programs converts tables to HTML tables , which I had assumed they would . But I think there is enough structure using
and class=3D to automate extraction . It's just more work using utilities I've created for parsing tables . This is the worst weekend to attack this because I have lots to get done by the 15th . I'd rather attack it after that . In any case , zeroing in on particular pages of interest would illuminate where to start . Sorry again to have missed the first posting of the links . Let me know what's most important and when it's needed . Bob A -- On Thu, 8 Apr 2004 16:02:42 -0500, jak wrote: >=A0Bob > >=A0I am sending an other email just in case you are not getting my= emails from >=A0my other account. I did send out the links before right after= our >=A0conversation last month. > >=A0http://www.nyc.gov/html/omb/pdf/cbgeo7_03.pdf >=A0http://www.nyc.gov/html/omb/pdf/cb7_03.pdf >=A0http://www.nyc.gov/html/omb/pdf/erc7_03.pdf > >=A0Jak >=A0212 314 5640 >=A0www.BailaTango.com/ny/ > >=A0to subscribe to regular email updates send an email to >=A0BailaTango-NY-subscribe@yahoogroups.com >=A0~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ --=A0 =A0Bob Armstrong -- http://CoSy.com -- 212-285-1864 Pix+ : .CoSy.MidWinterParty 17 =A0:= http://cosy.com/y04/MidWinter17.htm Computing Environment : =A0http://CoSy.com/CoSy/ A WTC vision : http://CoSy.com/CoSy/ConicAllConnect/ Libertarian Presidential Candidates =A0:= http://CoSy.com/Liberty.htm Restore our Right to Relax : =A0http://ny.lp.org/cgi-bin/petition.cgi?Against_the_Smoking_Ban =A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A02004/04/08 8:31:09 PM