This module provides functions to autodiscovery feed url in document.
(str) The MIME type of Atom format (application/atom+xml).
(str) The MIME type of RSS 2.0 format (application/rss+xml).
(collections.Mapping) The mapping table of feed types
Parse the given HTML and try finding the actual feed urls from it.
Changed in version 0.3.0: It became to find icon links as well, and find_feed_url() method (that returned only feed links) was gone, instead find() (that return a pair of feed links and icon links) was introduced.
Namedtuple which is a pair of type` and ``url
Alias for field number 0
Alias for field number 1
Exception raised when feed url cannot be found in html.
If the given url refers an actual feed, it returns the given url without any change.
If the given url is a url of an ordinary web page (i.e. text/html), it finds the urls of the corresponding feed. It returns feed urls in feed types’ lexicographical order.
If autodiscovery failed, it raise FeedUrlNotFoundError.
Parameters: | |
---|---|
Returns: | list of FeedLink objects |
Return type: | collections.MutableSequence |
Guess the syndication format of an arbitrary document.
Parameters: | document (str, bytes) – document string to guess |
---|---|
Returns: | the function possible to parse the given document |
Return type: | collections.Callable |
Changed in version 0.2.0: The function was in libearth.parser.heuristic module (which is removed now) before 0.2.0, but now it’s moved to libearth.parser.autodiscovery.