libearth.parser.atom — Atom parser¶
Parsing Atom feed. Atom specification is RFC 4287
-
libearth.parser.atom.ATOM_XMLNS_SET= frozenset(['http://purl.org/atom/ns#', 'http://www.w3.org/2005/Atom'])¶ (
frozenset) The set of XML namespaces for Atom format.
-
libearth.parser.atom.XML_XMLNS= 'http://www.w3.org/XML/1998/namespace'¶ (
str) The XML namespace for the predefinedxml:prefix.
-
libearth.parser.atom.parse_atom(xml, feed_url, parse_entry=True)¶ Atom parser. It parses the Atom XML and returns the feed data as internal representation.
Parameters: - xml (
str) – target atom xml to parse - feed_url (
str) – the url used to retrieve the atom feed. it will be the base url when there are any relative urls withoutxml:baseattribute - parse_entry (
bool) – whether to parse inner items as well. it’s useful to ignore items when retrieve<source>in rss 2.0.Trueby default.
Returns: a pair of (
Feed, crawler hint)Return type: - xml (