Plain Text

Plain text is the most basic form of text data, containing no formatting or other embedded information. This module provides functionality to load and process plain text content directly.
This module provides a straightforward text document loader that can:
Load text content directly
Process text with splitters
Add custom metadata
Handle escape characters
Support document splitting
Customize metadata extraction
Manage text encoding
Inputs
Required Parameters
Text: The plain text content to process
Optional Parameters
Text Splitter: A text splitter to process the content
Additional Metadata: JSON object with additional metadata
Omit Metadata Keys: Comma-separated list of metadata keys to omit
Outputs
Document: Array of document objects containing metadata and pageContent
Text: Concatenated string from pageContent of documents
Features
Direct text input
Text splitting support
Metadata handling
Error handling
Memory-efficient processing
Character encoding handling
Flexible output formats
Text Processing
Direct Mode
Single document creation
Preserves original text
Basic metadata handling
Memory efficient
Split Mode
Multiple document creation
Custom splitting rules
Individual chunk metadata
Granular content access
Document Structure
Each document contains:
pageContent: Original or split text content
metadata:
Custom metadata from input
Split-specific metadata (when using splitter)
Additional metadata properties
Content Handling
Text Input
Direct string input
Multi-line support
Unicode support
Escape character handling
Processing Options
Text splitting
Metadata addition
Character normalization
Whitespace handling
Notes
Simple and efficient
No file handling required
Memory-efficient processing
Error handling for invalid inputs
Support for large texts
Flexible output formats
Metadata customization
Character encoding support
Last updated