Cid Font F1 F2 F3 F4 Better Here

/F1 /CIDFontType0

import fitz # PyMuPDF doc = fitz.open("bad_fonts.pdf") for page in doc: for block in page.get_text("dict")["blocks"]: for line in block["lines"]: for span in line["spans"]: if span["font"].startswith(("F1","F2","F3","F4")): print(f"Found CID alias span['font'] at span['bbox']") # Fix: Re-encode page or extract text manually doc.close() cid font f1 f2 f3 f4 better

pdffonts yourfile.pdf Look for the "Type" column: CIDFontType0 or CIDFontType2 . Then inspect the "CMAP" column. If you see Identity-H but the language is Japanese, no direct conversion is possible without a custom CMAP. /F1 /CIDFontType0 import fitz # PyMuPDF doc = fitz