Get All Text

What it does: Extracts all visible text from a webpage, giving you the complete text content to analyze, summarize, or process with other tools.

Perfect for: Content analysis • Research collection • SEO audits • AI text processing

What Goes In

Name	Type	Description	Required	Default
Include Hidden Text	Boolean	Extract text from hidden elements too	No	`false`
Max Length	Number	Maximum characters to extract (0 = unlimited)	No	`100000`
Include Links	Boolean	Include URLs from links in the text	No	`false`

What Comes Out

Name	Type	Description
fullText	String	All the text content from the page
wordCount	Number	Total number of words found
characterCount	Number	Total number of characters
pageTitle	String	Title of the webpage
pageUrl	String	URL of the webpage

Real-World Examples

Content Analysis Extract all text from articles or blog posts to analyze writing style, sentiment, or key topics.

Research Data Collection Gather text content from multiple research sources to build a comprehensive knowledge base.

SEO Content Review Extract all text from webpages to analyze keyword density and content quality.

How to Use It

Navigate to the webpage you want to extract text from
Configure the settings - set limits or include options as needed
Run the workflow - the node automatically extracts all visible text
Process the text with other nodes like AI analysis or data storage

Simple Example:

{
  "includeHiddenText": false,
  "maxLength": 50000,
  "includeLinks": true
}

🔍 Technical Details

Text Extraction: The node reads all visible text elements on the page, excluding navigation menus, ads, and other non-content areas when possible.

Content Filtering: You can exclude hidden text, limit the amount of text extracted, and choose whether to include link URLs.

Performance: Large pages may take longer to process. Setting a max length helps prevent memory issues.

Limitations:

Cannot extract text from images or videos
Some dynamic content may not be captured if it loads after the page
Very large pages may hit browser memory limits

Try It Yourself

Article Analysis:

{
  "includeHiddenText": false,
  "maxLength": 0,
  "includeLinks": false
}

Research Collection:

{
  "includeHiddenText": false,
  "maxLength": 25000,
  "includeLinks": true
}

Content Monitoring:

{
  "includeHiddenText": false,
  "maxLength": 10000,
  "includeLinks": false
}

Common Issues:

No text extracted? The page might still be loading - try waiting a moment before running
Too much irrelevant text? The extraction includes navigation and footer text - use AI filtering to focus on main content
Missing some content? Dynamic content that loads after the page may not be captured

What’s Next?

Related nodes: Get Selected Text • Get All HTML • Content Replacer

Common workflows: Smart Text Extraction • Content Analysis • Research Automation

Build a research database: Combine with Edit Fields to clean and organize extracted content