Ted Leung on the air
Ted Leung on the air: Open Source, Java, Python, and ...
Ted Leung on the air: Open Source, Java, Python, and ...
Thu, 10 Apr 2003
RSS quickies
Greg Reinacker's making the Windows Event Log available as RSS.
Jon Udell describes how targeted RSS feeds could help corporate PR flacks.
Chad Osgood describes Extensible Data Transform (XDT), a mechanism for converting wild data to RSS / XML. Think of it as XSLT that can use regexps to extract data from wonky (not-quite XML documents)
[11:33] |
[computers/internet/weblogs] |
# |
TB |
F |
G |
2 Comments |
Jon Udell describes how targeted RSS feeds could help corporate PR flacks.
Chad Osgood describes Extensible Data Transform (XDT), a mechanism for converting wild data to RSS / XML. Think of it as XSLT that can use regexps to extract data from wonky (not-quite XML documents)
Yes, RSS is a perfect target for screen scrapers. Here are feeds that scrape news from Amensty International and Greeanpeace with the python hacks to do it:
http://www.dehora.net/amnesty/amnesty-rss10.xml
http://www.dehora.net/amnesty/air.py
http://www.dehora.net/rss/greenpeace-rss10.xml
http://www.dehora.net/code/gpr.py
I think using a scripting language is more flexible for scraping. No doubt this is why Google use Python. For example if you look the greenpeace python, ugly as it is, you'll see follows a link to get a long description.
Posted by Bill de hÓra at Thu Apr 10 15:22:02 2003
http://www.dehora.net/amnesty/amnesty-rss10.xml
http://www.dehora.net/amnesty/air.py
http://www.dehora.net/rss/greenpeace-rss10.xml
http://www.dehora.net/code/gpr.py
I think using a scripting language is more flexible for scraping. No doubt this is why Google use Python. For example if you look the greenpeace python, ugly as it is, you'll see follows a link to get a long description.
Posted by Bill de hÓra at Thu Apr 10 15:22:02 2003
Greg Reinacker's making the Windows Event Log available as RSS.
Posted by Trackback from Coty's Radio Weblog at Tue Sep 23 07:56:48 2003
Posted by Trackback from Coty's Radio Weblog at Tue Sep 23 07:56:48 2003
You can subscribe to an RSS feed of the comments for this blog:
Add a comment here:
You can use some HTML tags in the comment text:
To insert a URI, just type it -- no need to write an anchor tag.
Allowable html tags are:
You can also use some Wiki style:
URI => [uri title]
<em> => _emphasized text_
<b> => *bold text*
Ordered list => consecutive lines starting spaces and an asterisk
To insert a URI, just type it -- no need to write an anchor tag.
Allowable html tags are:
<a href>
, <em>
, <i>
, <b>
, <blockquote>
, <br/>
, <p>
, <code>
, <pre>
, <cite>
, <sub>
and <sup>
.You can also use some Wiki style:
URI => [uri title]
<em> => _emphasized text_
<b> => *bold text*
Ordered list => consecutive lines starting spaces and an asterisk