Folder
Last updated
Last updated
The Folder Loader provides functionality to load and process multiple files from a directory. This module supports a wide range of file formats and can recursively process subdirectories.
This module provides a sophisticated folder loader that can:
Load multiple file types simultaneously
Process directories recursively
Handle various document formats
Support PDF-specific processing
Process structured data files
Customize metadata extraction
Support text splitting
Folder Path: Path to the directory containing files
Recursive: Whether to process subdirectories
Text Splitter: A text splitter to process the extracted content
PDF Usage: Choose between:
One document per page
One document per file
JSONL Pointer Extraction: Pointer name for JSONL files
Additional Metadata: JSON object with additional metadata
Omit Metadata Keys: Comma-separated list of metadata keys to omit
Document: Array of document objects containing metadata and pageContent
Text: Concatenated string from pageContent of documents
PDF (.pdf)
Word (.doc, .docx)
Excel (.xls, .xlsx, .xlsm, .xlsb)
PowerPoint (.ppt, .pptx)
Text (.txt)
Markdown (.md, .markdown)
HTML (.html)
XML (.xml)
JSON (.json)
JSONL (.jsonl)
CSV (.csv)
Python (.py, .python)
JavaScript (.js)
TypeScript (.ts)
Java (.java)
C/C++ (.c, .cpp, .h)
C# (.cs)
Ruby (.rb, .ruby)
Go (.go)
PHP (.php)
Swift (.swift)
Rust (.rs)
Scala (.scala, .sc)
Kotlin (.kt)
Solidity (.sol)
CSS (.css)
SCSS (.scss)
LESS (.less)
SQL (.sql)
Protocol Buffers (.proto)
Multi-format support
Recursive directory processing
PDF processing options
Structured data handling
Text splitting support
Metadata customization
Error handling
Automatically detects file types
Handles large directories
Preserves file metadata
Memory-efficient processing
Supports custom file extensions
Error handling for invalid files
Flexible output formats