Key Features
OCR Technology
Advanced optical character recognition extracts text from images and scanned documents
Multi-Format Support
Process PDFs, images (JPG, PNG, TIFF), and text-based documents
Raw Text Extraction
Extracts plain text content without structure analysis
Automation Ready
Seamlessly integrates with automation workflows for document processing
Supported File Types
The Text File Reader can process:Document Formats
Document Formats
- PDF - Text-based and scanned PDFs
- DOC/DOCX - Microsoft Word documents
- TXT - Plain text files
- RTF - Rich text format documents
Image Formats
Image Formats
- JPG/JPEG - Compressed image files
- PNG - Portable network graphics
- TIFF - Tagged image file format
- BMP - Bitmap image files
Scanned Documents
Scanned Documents
- Business letters and memos
- Contracts and agreements
- Forms and applications
- Reports and documentation
Creating a Text File Reader
1
Navigate to File Readers
In your application, go to File Readers section
2
Create New Reader
Click + File Reader and select Text from the document type options
3
Configure Basic Settings
Name: Enter a descriptive name (e.g., “Contract Text Extraction”)Description: Optional description for team reference
4
Set Up Text Fields
Configure the text fields you want to extract:Field Name: Descriptive name for the extracted text
Field Type: Select “Text” for raw text extraction
Required: Mark if this field must contain content
5
Test with Sample Document
Upload a sample document to validate text extraction accuracy
Configuration Options
Field Configuration
The Text File Reader extracts content into configurable text fields:- Single Text Field
- Multiple Text Fields
Use Case: Extract all text into one fieldConfiguration:
- Field Name: “extracted_text”
- Field Type: Text
- Description: “Complete document text content”
OCR Quality Settings
OCR accuracy depends on document quality. For best results, use high-resolution images and clear text formatting.
- Image resolution (higher is better)
- Text clarity and contrast
- Font size and style
- Document orientation
- Background interference
Using in Automations
Integration with Automation Workflows
The Text File Reader integrates seamlessly with automation workflows:Common Automation Patterns
Document Processing
Document Processing
Trigger: Email Received (with PDF attachment)
File Reader: Extract text from PDF
Actions:
- AI Classification to categorize content
- Create Record with extracted text
- Send Email Notification to confirm processing
Content Analysis
Content Analysis
Trigger: Attachment Added
File Reader: Extract text from image
Actions:
- AI Summarization to create brief summary
- Post Comment with extracted content
- Update Record Fields with text data
Text Search and Indexing
Text Search and Indexing
Trigger: Record Created
File Reader: Process uploaded document
Actions:
- Transform Data to clean text
- Update Record Fields for searchability
- Generate Report with text content
File Reader Action Configuration
When adding a File Reader action to your automation:- Select File Reader Type: Choose your configured Text File Reader
- Configure Input Source:
- File from Email Received trigger
- Attachment from Record
- Upload from user interaction
- Map Output Fields: Use extracted text in subsequent actions
- Handle Errors: Add IF conditions for missing or invalid content
Best Practices
Document Quality
Use high-resolution, clearly formatted documents for best OCR accuracy
Field Validation
Implement validation checks in automations for critical text extraction
Error Handling
Use IF conditions to handle cases where text extraction fails
Performance Optimization
Consider file size limits and processing time for large documents
Document Preparation Tips
For Best OCR Results:- Use documents with clear, dark text on light backgrounds
- Ensure proper orientation (not rotated or tilted)
- Avoid documents with heavy background patterns
- Use standard fonts when possible
- Maintain good contrast between text and background
- Optimize image resolution (300 DPI is typically sufficient)
- Balance quality with processing time
- Consider splitting large documents into smaller sections
Advanced Features
Text Post-Processing
After OCR extraction, you can enhance the text using automation actions:1
Transform Data with AI
Clean and normalize extracted text, fix common OCR errors
2
AI Classification
Categorize extracted content based on text analysis
3
AI Summarization
Create concise summaries of lengthy extracted text
Batch Processing
Process multiple documents efficiently:- Set up trigger for multiple file uploads
- Use Repeat For Each to iterate through files
- Apply Text File Reader to each document
- Process extracted text with subsequent actions
Error Handling and Troubleshooting
Common Issues
Poor Text Recognition
Poor Text Recognition
Symptoms: Inaccurate or garbled text extractionCauses:
- Low image resolution
- Poor document quality
- Unusual fonts or formatting
- Use higher resolution source documents
- Improve document clarity before processing
- Consider using Elementum Intelligence File Reader for better accuracy
Missing Text Content
Missing Text Content
Symptoms: Text fields return empty valuesCauses:
- Document contains only images/graphics
- Text is embedded in unsupported format
- OCR failed to detect text regions
- Verify document contains readable text
- Use supported file formats
- Test with different document samples
Processing Timeouts
Processing Timeouts
Symptoms: File Reader action fails with timeout errorCauses:
- Document is too large
- High complexity image processing
- System resource limitations
- Reduce document size or resolution
- Split large documents into smaller sections
- Process during off-peak hours
Validation and Testing
Always test your Text File Reader with representative sample documents before deploying in production automations.
- Upload sample documents similar to production files
- Review extracted text for accuracy
- Test with different document qualities and formats
- Validate field mapping and data types
- Test error handling with problematic documents
Comparison with Other File Readers
When to Use Text File Reader
Choose Text File Reader when:- You need simple text extraction without structure
- Processing scanned documents or images
- Working with various document formats
- OCR accuracy is sufficient for your use case
- You need structured data extraction (Purchase Orders Reader)
- Working with spreadsheets (Table File Reader)
- Requiring AI-powered analysis (Elementum Intelligence Reader)
- Processing JSON data (JSON File Reader)
Performance Comparison
Feature | Text File Reader | Elementum Intelligence | Purchase Orders |
---|---|---|---|
Speed | ⭐⭐ | ⭐⭐⭐ | ⭐⭐⭐⭐ |
Accuracy | ⭐⭐⭐ | ⭐⭐⭐⭐⭐ | ⭐⭐⭐⭐⭐ |
Flexibility | ⭐⭐ | ⭐⭐⭐⭐⭐ | ⭐⭐ |
Cost | ⭐⭐⭐⭐⭐ | ⭐⭐⭐ | ⭐⭐⭐⭐ |
Integration Examples
Email Processing Workflow
- Trigger: Email Received
- File Reader: Extract text from email attachment
- AI Classification: Categorize issue type
- Create Record: Generate support ticket with extracted text
- Send Email: Confirm receipt and ticket number
Document Archival System
- Trigger: Attachment Added
- File Reader: Extract text content
- Transform Data: Clean and standardize text
- Update Record: Store extracted content for search
- Generate Report: Create searchable document index
Next Steps
Automation System
Learn how to integrate Text File Readers with automation workflows
AI Services
Enhance text processing with AI classification and summarization
Purchase Orders Reader
Process structured business documents with specialized OCR
Elementum Intelligence Reader
Upgrade to AI-powered document analysis for complex extraction
The Text File Reader provides reliable OCR-based text extraction for a wide variety of document types. Use it for simple text extraction needs or as a foundation for more complex document processing workflows.