Jan 22

HTML to RSS Conversion?

by in Techie :: Techno ::

For a variety of reasons I’ve been trying to find a simple way of converting plain HTML pages into RSS.
Does anyone know of a script that can do this?
It doesn’t matter which platform the script runs on, as long as it isn’t Apple Mac (as I dont’ have access to any)
Obviously I’d prefer if the script was linux based ie. php, perl or python, but I’m not adverse to using ASP if it does the job

Related Posts:

  • No Related Posts

8 Responses to “HTML to RSS Conversion?”

  1. From Kae Verens:

    here is some code which I wrote for http://contemporaryliving.ie/ – see http://webmecms.org/common/funcs.blogs.phps
    note that it’s not as straightforward as “html2rss($htmlurl)”, but it should hopefully help.

    Posted on January 22, 2006 at 9:23 pm #
  2. From michele:

    Kae
    Thanks.
    So how do you call it? :)

    Posted on January 22, 2006 at 11:08 pm #
  3. From Kae Verens:

    hehe – see that’s the thing – that’s a function that builds a webpage from a database entry, then converts that to RSS. You most likely do not use the same db schema, so the code will not be immediately useful to you. but – get one of your code monkeys to throw an eye over it and they’ll have something up and running in no time.

    Posted on January 22, 2006 at 11:18 pm #
  4. From michele:

    Kae – code monkeys? Oh dear !! I’ll tell them :P
    Not sure how they’ll like that though
    As you rightly guessed I don’t have the content in a database.. It’s all plain old-fashioned HTML
    Thanks anyway..
    Michele

    Posted on January 22, 2006 at 11:19 pm #
  5. From Rob:

    Assuming the HTML is fairly regular, it should be quite easy to write such a thing. Loop through pages, extract any metadata you want to use in the RSS, and you’re done.

    Posted on January 23, 2006 at 1:05 pm #
  6. From michele:

    Rob – Since I am not a coder it would be like telling me that climbing Everest isn’t an issue

    Posted on January 23, 2006 at 1:24 pm #
  7. From Rob:

    What’s the form of the HTML files? It might be literally a few lines.

    Posted on January 23, 2006 at 1:31 pm #
  8. From michele:

    Rob – I’ve emailed you with a link to it
    M

    Posted on January 23, 2006 at 1:50 pm #