Module parser

Module parser 

Expand description

PubMed XML parser module

This module provides functionality for parsing PubMed EFetch XML responses into structured article metadata. The parser handles complex XML structures including authors, affiliations, MeSH terms, chemicals, and structured abstracts.

§Module Organization

  • preprocessing - XML cleaning and preparation
  • deserializers - Custom serde deserializers for complex fields
  • extractors - Data extraction utilities (email, country, names)
  • xml_types - Internal XML schema deserialization types
  • converters - Conversion from XML types to public API models

§Public API

The main entry point is parse_article_from_xml, which takes a PubMed EFetch XML response and returns a PubMedArticle.

Functions§

parse_article_from_xml
Parse article from EFetch XML response
parse_articles_from_xml
Parse multiple PubMed articles from EFetch XML response