Skip to content

Get All Text

What it does: Extracts all visible text from a webpage, giving you the complete text content to analyze, summarize, or process with other tools.

Perfect for: Content analysis • Research collection • SEO audits • AI text processing

NameTypeDescriptionRequiredDefault
Include Hidden TextBooleanExtract text from hidden elements tooNofalse
Max LengthNumberMaximum characters to extract (0 = unlimited)No100000
Include LinksBooleanInclude URLs from links in the textNofalse
NameTypeDescription
fullTextStringAll the text content from the page
wordCountNumberTotal number of words found
characterCountNumberTotal number of characters
pageTitleStringTitle of the webpage
pageUrlStringURL of the webpage

Content Analysis Extract all text from articles or blog posts to analyze writing style, sentiment, or key topics.

Research Data Collection Gather text content from multiple research sources to build a comprehensive knowledge base.

SEO Content Review Extract all text from webpages to analyze keyword density and content quality.

  1. Navigate to the webpage you want to extract text from
  2. Configure the settings - set limits or include options as needed
  3. Run the workflow - the node automatically extracts all visible text
  4. Process the text with other nodes like AI analysis or data storage

Simple Example:

{
"includeHiddenText": false,
"maxLength": 50000,
"includeLinks": true
}
🔍 Technical Details

Text Extraction: The node reads all visible text elements on the page, excluding navigation menus, ads, and other non-content areas when possible.

Content Filtering: You can exclude hidden text, limit the amount of text extracted, and choose whether to include link URLs.

Performance: Large pages may take longer to process. Setting a max length helps prevent memory issues.

Limitations:

  • Cannot extract text from images or videos
  • Some dynamic content may not be captured if it loads after the page
  • Very large pages may hit browser memory limits

Article Analysis:

{
"includeHiddenText": false,
"maxLength": 0,
"includeLinks": false
}

Research Collection:

{
"includeHiddenText": false,
"maxLength": 25000,
"includeLinks": true
}

Content Monitoring:

{
"includeHiddenText": false,
"maxLength": 10000,
"includeLinks": false
}

Common Issues:

  • No text extracted? The page might still be loading - try waiting a moment before running
  • Too much irrelevant text? The extraction includes navigation and footer text - use AI filtering to focus on main content
  • Missing some content? Dynamic content that loads after the page may not be captured

Related nodes: Get Selected TextGet All HTMLContent Replacer

Common workflows: Smart Text ExtractionContent AnalysisResearch Automation

Build a research database: Combine with Edit Fields to clean and organize extracted content