Skip to content

Get All Text

The Get All Text node captures the readable text from the page you’re currently viewing.

Use it when you want the page content as plain text (for example, to summarize it with AI or save it as notes).

Illustration of extracting all text content from a webpage
  • You want to summarize an article.
  • You want to store page content in a local knowledge base.
  • You want to analyze or classify a page (topics, sentiment, etc.).
  • You want a “copy everything” step without manually selecting text.

The node scans the page and returns the text it can read. It also returns basic stats like word and character count.

graph LR
  Page[Web Page] --> Extractor{Text Extractor}
  Extractor --> Text[Clean Text]
  Extractor --> Stats[Word Count]
  style Extractor fill:#6d28d9,stroke:#fff,color:#fff
  1. Open the page you want to extract text from.
  2. Add this node to your workflow.
  3. Run the workflow.
  4. Use the output in the next node (for example: Filter, Edit Fields, or an AI node).

Let’s extract all text from a news article to analyze its content and key topics.

What you configure:

  • Include Hidden Text: Decide if you want text that isn’t visible on screen.
  • Max Length: Limit text size if you only need a portion.
  • Clean Formatting: Remove extra spaces and special characters for cleaner output.

What you get:

  • Full Text: The readable content of the page.
  • Stats: Total word count and character count.
  • Page Info: Title and URL of the source page.
SettingPurposeWhen to Use
Include Hidden TextExtract text from hidden elementsWhen you need complete content including metadata
Max LengthLimit characters extractedFor large pages or memory management
Include LinksAdd URLs from links in the textWhen link context is important for analysis
  • No text extracted: the page may still be loading, or the content appears after a delay. Try running again after the page finishes loading.
  • Too much irrelevant text: some pages include menus and footers. You can filter or summarize the text in a later step.
  • Missing content: if the page loads content dynamically, try scrolling the page first so the content appears before you run the workflow.