Text File Reader - Elementum Documentation

The Text File Reader uses Optical Character Recognition (OCR) to extract raw text content from images, text-based documents, and PDFs. This reader is ideal for simple text extraction when you need to convert visual content into editable text for use in automation workflows.

Key Features

OCR Technology

Advanced optical character recognition extracts text from images and scanned documents

Multi-Format Support

Process PDFs, images (JPG, PNG, TIFF), and text-based documents

Raw Text Extraction

Extracts plain text content without structure analysis

Automation Ready

Seamlessly integrates with automation workflows for document processing

Supported File Types

The Text File Reader can process:

Document Formats

PDF - Text-based and scanned PDFs
DOC/DOCX - Microsoft Word documents
TXT - Plain text files
RTF - Rich text format documents

Image Formats

JPG/JPEG - Compressed image files
PNG - Portable network graphics
TIFF - Tagged image file format
BMP - Bitmap image files

Scanned Documents

Business letters and memos
Contracts and agreements
Forms and applications
Reports and documentation

Creating a Text File Reader

Navigate to File Readers

In your application, go to File Readers section

Create New Reader

Click + File Reader and select Text from the document type options

Configure Basic Settings

Name: Enter a descriptive name (e.g., “Contract Text Extraction”)Description: Optional description for team reference

Set Up Text Fields

Configure the text fields you want to extract:Field Name: Descriptive name for the extracted text Field Type: Select “Text” for raw text extraction Required: Mark if this field must contain content

Test with Sample Document

Upload a sample document to validate text extraction accuracy

Configuration Options

Field Configuration

The Text File Reader extracts content into configurable text fields:

Single Text Field
Multiple Text Fields

Use Case: Extract all text into one fieldConfiguration:

Field Name: “extracted_text”
Field Type: Text
Description: “Complete document text content”

Best for: Simple text extraction, content analysis

OCR Quality Settings

OCR accuracy depends on document quality. For best results, use high-resolution images and clear text formatting.

Factors affecting OCR accuracy:

Image resolution (higher is better)
Text clarity and contrast
Font size and style
Document orientation
Background interference

Using in Automations

Integration with Automation Workflows

The Text File Reader integrates seamlessly with automation workflows:

Email Received → Text File Reader → AI Classification → Create Record → Send Notification

Common Automation Patterns

Document Processing

Trigger: Email Received (with PDF attachment) File Reader: Extract text from PDF Actions:

AI Classification to categorize content
Create Record with extracted text
Send Email Notification to confirm processing

Content Analysis

Trigger: Attachment Added File Reader: Extract text from image Actions:

AI Summarization to create brief summary
Post Comment with extracted content
Update Record Fields with text data

Text Search and Indexing

Trigger: Record Created File Reader: Process uploaded document Actions:

Transform Data to clean text
Update Record Fields for searchability
Generate Report with text content

File Reader Action Configuration

When adding a File Reader action to your automation:

Select File Reader Type: Choose your configured Text File Reader
Configure Input Source:
- File from Email Received trigger
- Attachment from Record
- Upload from user interaction
Map Output Fields: Use extracted text in subsequent actions
Handle Errors: Add IF conditions for missing or invalid content

Best Practices

Document Quality

Use high-resolution, clearly formatted documents for best OCR accuracy

Field Validation

Implement validation checks in automations for critical text extraction

Error Handling

Use IF conditions to handle cases where text extraction fails

Performance Optimization

Consider file size limits and processing time for large documents

Document Preparation Tips

For Best OCR Results:

Use documents with clear, dark text on light backgrounds
Ensure proper orientation (not rotated or tilted)
Avoid documents with heavy background patterns
Use standard fonts when possible
Maintain good contrast between text and background

File Size Considerations:

Optimize image resolution (300 DPI is typically sufficient)
Balance quality with processing time
Consider splitting large documents into smaller sections

Advanced Features

Text Post-Processing

After OCR extraction, you can enhance the text using automation actions:

Transform Data with AI

Clean and normalize extracted text, fix common OCR errors

AI Classification

Categorize extracted content based on text analysis

AI Summarization

Create concise summaries of lengthy extracted text

Batch Processing

Process multiple documents efficiently:

File Upload → Repeat For Each → Text File Reader → Transform Data → Update Records

Configuration:

Set up trigger for multiple file uploads
Use Repeat For Each to iterate through files
Apply Text File Reader to each document
Process extracted text with subsequent actions

Error Handling and Troubleshooting

Common Issues

Poor Text Recognition

Symptoms: Inaccurate or garbled text extractionCauses:

Low image resolution
Poor document quality
Unusual fonts or formatting

Solutions:

Use higher resolution source documents
Improve document clarity before processing
Consider using Elementum Intelligence File Reader for better accuracy

Missing Text Content

Symptoms: Text fields return empty valuesCauses:

Document contains only images/graphics
Text is embedded in unsupported format
OCR failed to detect text regions

Solutions:

Verify document contains readable text
Use supported file formats
Test with different document samples

Processing Timeouts

Symptoms: File Reader action fails with timeout errorCauses:

Document is too large
High complexity image processing
System resource limitations

Solutions:

Reduce document size or resolution
Split large documents into smaller sections
Process during off-peak hours

Validation and Testing

Always test your Text File Reader with representative sample documents before deploying in production automations.

Testing Process:

Upload sample documents similar to production files
Review extracted text for accuracy
Test with different document qualities and formats
Validate field mapping and data types
Test error handling with problematic documents

Comparison with Other File Readers

When to Use Text File Reader

Choose Text File Reader when:

You need simple text extraction without structure
Processing scanned documents or images
Working with various document formats
OCR accuracy is sufficient for your use case

Consider alternatives when:

You need structured data extraction (Purchase Orders Reader)
Working with spreadsheets (Table File Reader)
Requiring AI-powered analysis (Elementum Intelligence Reader)
Processing JSON data (JSON File Reader)

Performance Comparison

Feature	Text File Reader	Elementum Intelligence	Purchase Orders
Speed	⭐⭐	⭐⭐⭐	⭐⭐⭐⭐
Accuracy	⭐⭐⭐	⭐⭐⭐⭐⭐	⭐⭐⭐⭐⭐
Flexibility	⭐⭐	⭐⭐⭐⭐⭐	⭐⭐
Cost	⭐⭐⭐⭐⭐	⭐⭐⭐	⭐⭐⭐⭐

Integration Examples

Email Processing Workflow

Email Received (with attachment) → Text File Reader → AI Classification → Create Support Ticket

Configuration:

Trigger: Email Received
File Reader: Extract text from email attachment
AI Classification: Categorize issue type
Create Record: Generate support ticket with extracted text
Send Email: Confirm receipt and ticket number

Document Archival System

Document Upload → Text File Reader → Transform Data → Update Record → Generate Report

Configuration:

Trigger: Attachment Added
File Reader: Extract text content
Transform Data: Clean and standardize text
Update Record: Store extracted content for search
Generate Report: Create searchable document index

Next Steps

Automation System

Learn how to integrate Text File Readers with automation workflows

AI Services

Enhance text processing with AI classification and summarization

Purchase Orders Reader

Process structured business documents with specialized OCR

Elementum Intelligence Reader

Upgrade to AI-powered document analysis for complex extraction

The Text File Reader provides reliable OCR-based text extraction for a wide variety of document types. Use it for simple text extraction needs or as a foundation for more complex document processing workflows.

Workflow Design

Rules & Automation

Record Management

​Key Features

OCR Technology

Multi-Format Support

Raw Text Extraction

Automation Ready

​Supported File Types

​Creating a Text File Reader

​Configuration Options

​Field Configuration

​OCR Quality Settings

​Using in Automations

​Integration with Automation Workflows

​Common Automation Patterns

​File Reader Action Configuration

​Best Practices

Document Quality

Field Validation

Error Handling

Performance Optimization

​Document Preparation Tips

​Advanced Features

​Text Post-Processing

​Batch Processing

​Error Handling and Troubleshooting

​Common Issues

​Validation and Testing

​Comparison with Other File Readers

​When to Use Text File Reader

​Performance Comparison

​Integration Examples

​Email Processing Workflow

​Document Archival System

​Next Steps

Automation System

AI Services

Purchase Orders Reader

Elementum Intelligence Reader

Key Features

Supported File Types

Creating a Text File Reader

Configuration Options

Field Configuration

OCR Quality Settings

Using in Automations

Integration with Automation Workflows

Common Automation Patterns

File Reader Action Configuration

Best Practices

Document Preparation Tips

Advanced Features

Text Post-Processing

Batch Processing

Error Handling and Troubleshooting

Common Issues

Validation and Testing

Comparison with Other File Readers

When to Use Text File Reader

Performance Comparison

Integration Examples

Email Processing Workflow

Document Archival System

Next Steps