Module xml_utils
Expand description
XML parsing utilities for PMC parser
This module re-exports common XML parsing utilities.
The actual implementation has been moved to crate::common::xml_utils
for sharing between PubMed and PMC parsers.
Functionsยง
- decode_
xml_ entities - Decode XML character entities in a string
- extract_
all_ attributes - Extract all attributes from an XML tag
- extract_
all_ text_ between - Extract content between tags for all occurrences
- extract_
attribute_ value - Extract attribute value from XML tag
- extract_
element_ content - Extract element content with its tag name
- extract_
section_ text - Extract text content from a section, handling nested tags
- extract_
text_ between - Extract text between two XML tags
- extract_
text_ between_ ref - Extract text between two XML tags as a borrowed string slice
- find_
all_ tags - Find all occurrences of a tag in content
- is_
self_ closing_ tag - Check if a tag is self-closing
- strip_
inline_ html_ tags - Strip inline HTML-like formatting tags from XML content
- strip_
xml_ tags - Strip XML tags from content