Skip to content

Changelog

Changelog

1.0.3 (2026-03-27)

Bug Fixes

  • address security and code quality issues (#14, #37, #59, #63, #64) (4845e67)
  • address security and code quality issues (#14, #37, #59, #63, #64) (9c9fb18)
  • bump pypdf >=6.9.1 to address CVE-2026-33123 DoS vulnerability (#55) (b9e2a8c)
  • strengthen type annotations and simplify docling availability check (00f951b)
  • strengthen type annotations and simplify docling check (86dfa39)

Documentation

  • sync documentation with codebase (f797c3d)

1.0.2 (2026-03-20)

Bug Fixes

  • eliminate hidden mutation side-effect in _match_image_ref (#41) (e6dc690)

1.0.1 (2026-03-17)

Bug Fixes

  • add upper-bound version pins for markitdown and docling optional deps (ef63111), closes #30
  • add upper-bound version pins for markitdown and docling optional… (4471a29)
  • bump Pillow to >=12.1,<13 to address CVE-2026-25990 (#29) (ee2cc8e)
  • move stdlib xml.etree.ElementTree import to TYPE_CHECKING block (#17) (829cea3)
  • regenerate pixi.lock in release-please PR (c4f6c76)
  • regenerate pixi.lock in release-please PR (1ea1ccb)

1.0.0 (2026-03-12)

  • feat: embedded media extraction for PDF, DOCX, PPTX (per-document media folders with wikilinks)
  • feat: config_for_backend() convenience API for quick single-backend configuration
  • feat: MediaConfig for image extraction settings (format, max dimension, enable/disable)
  • deps: added Pillow>=10.0,<12
  • BREAKING: ImportConfig requires media: MediaConfig field
  • BREAKING: backend extract() returns ExtractionResult (with .markdown and .media_files) instead of str

0.2.0 (2026-03-10)

  • Native backends for CSV, JSON, YAML, and image files
  • Image embedding: generates Obsidian ![[filename]] wikilinks and copies source images to vault
  • Pass-through mode: copy files as-is without extraction (configurable by extension, glob, regex)
  • Per-extension backend configuration (backends.csv, backends.json, backends.yaml, backends.image)
  • OutputConflictError exception for destination file conflicts

0.1.0 (2026-03-09)

Initial release.

  • Native backends: PDF (pdfplumber+pypdf), DOCX (defusedxml), PPTX (python-pptx), XLSX (openpyxl)
  • Optional backends: markitdown (fallback), docling (high-quality)
  • Config-driven backend selection per file type
  • Glob-based file discovery with exclude patterns
  • Obsidian-flavored markdown output with YAML frontmatter
  • Click CLI: convert, discover, batch, doctor
  • YAML configuration with deep-merge defaults