PDF Files

Portable Document Format (PDF), standardized as ISO 32000, is a file format developed by Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, hardware, and operating systems. The Pdf File module decodes the base64-encoded data from the PDF document and then loads the PDF content. If a textSplitter is provided, it uses it to split the text content.

Inputs

Text Splitter (optional) PDF File Usage One Document per Page OR One Document per File

Output

loads PDF content

Last updated