Extracting information from a website in Ruby

We want to integrate news items from third party news web sites into the news feed of our own web site (with proper attribution of the source, of course) and link them to the original source. For this we need to save at least the following attributes of an item: <ul> <li>Title</li> <li>Short Description</li> <li>Link to the original source</li> <li>(optional) Image</li> </ul> This information extraction should be done in Ruby. The process should be explained for every generic news website, but for a first solution the news website suffices.
Subscribe to Ruby