20 January 2005

XML Digger: Tunneling into buried feeds and web content to post to XML-search tools


Creative Commons License

Link to this content and cite as "Mud's Tests." This work is licensed under a Creative Commons License.

Creative Commons License

Link to this content and cite as "Mud's Tests." This work is licensed under a Creative Commons License.
-->

Summary


Details on concept to dig into non-pinged XML content, then allow others to find, archive, and search information. Converts non-XML content into XML-searchable content.

This approach focuses only on the content-end, and ignores the understandable limitations of XML-related search tools that continue to modernize as XML evolves.


Problem description


There seems to be a recurring problem with ancient archives not showing up in XML search tools. Either:

  • content is not pinged; or

  • the content is so far buried, that it remains beyond the reach of tools and search tools; or

  • the content is part of a package of multiple blog entries that get caught in an avalanche of posts, which the search tools and XML systems interpret as a "single update" and only take the first blog entry.

    Whatever the cause of the problem, content doesn't show up in a search tool.


    Suggestion

    Tool that uses XML-search to find content that has not been specifically pinged, but exists on the platform providing content.

    Finds:

    1. Non-XML content
    2. Content not pinged

    Converts non-XML-information into XML-accessible; then presents it for access and search.


    Features

  • Non-XML content able to be retroactively searchable

  • Find stuff outside the feed

  • Use an XML-focused tool to not dig into:
    - XML pinged-sites;
    - finds original content "not-archived in XML-search tools via Pings"

    Creative Commons License

    Link to this content and cite as "Mud's Tests." This work is licensed under a Creative Commons License.

  • Summary


    Details on concept to dig into non-pinged XML content, then allow others to find, archive, and search information. Converts non-XML content into XML-searchable content.

    This approach focuses only on the content-end, and ignores the understandable limitations of XML-related search tools that continue to modernize as XML evolves.


    Problem description


    There seems to be a recurring problem with ancient archives not showing up in XML search tools. Either:

  • content is not pinged; or

  • the content is so far buried, that it remains beyond the reach of tools and search tools; or

  • the content is part of a package of multiple blog entries that get caught in an avalanche of posts, which the search tools and XML systems interpret as a "single update" and only take the first blog entry.

    Whatever the cause of the problem, content doesn't show up in a search tool.


    Suggestion

    Tool that uses XML-search to find content that has not been specifically pinged, but exists on the platform providing content.

    Finds:

    1. Non-XML content
    2. Content not pinged

    Converts non-XML-information into XML-accessible; then presents it for access and search.


    Features

  • Non-XML content able to be retroactively searchable

  • Find stuff outside the feed

  • Use an XML-focused tool to not dig into:
    - XML pinged-sites;
    - finds original content "not-archived in XML-search tools via Pings"

    Creative Commons License

    Link to this content and cite as "Mud's Tests." This work is licensed under a Creative Commons License.
    " />