PDF Files
Portable Document Format (PDF), standardized as ISO 32000, is a file format developed by Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, hardware, and operating systems. The Pdf File module decodes the base64-encoded data from the PDF document and then loads the PDF content. If a textSplitter is provided, it uses it to split the text content.
Inputs
Text Splitter (optional) PDF File Usage One Document per Page OR One Document per File
Output
loads PDF content
Last updated