Module xml_utils
Expand description
Common XML parsing utilities shared between PubMed and PMC parsers
This module provides reusable XML parsing functions for both string-based and serde-based XML parsing workflows.
Functionsยง
- decode_
xml_ entities - Decode XML character entities in a string
- extract_
all_ attributes - Extract all attributes from an XML tag
- extract_
all_ text_ between - Extract content between tags for all occurrences
- extract_
attribute_ value - Extract attribute value from XML tag
- extract_
element_ content - Extract element content with its tag name
- extract_
section_ text - Extract text content from a section, handling nested tags
- extract_
text_ between - Extract text between two XML tags
- extract_
text_ between_ ref - Extract text between two XML tags as a borrowed string slice
- find_
all_ tags - Find all occurrences of a tag in content
- is_
self_ closing_ tag - Check if a tag is self-closing
- strip_
inline_ html_ tags - Strip inline HTML-like formatting tags from XML content
- strip_
xml_ tags - Strip XML tags from content