Confluence Pages
Alhena AI supports data ingestion from Confluence pages.
This process begins when you provide a Confluence URL to Alhena AI. A dialog box will then appear, prompting you to connect with the Confluence app.
This connection involves an OAuth flow that securely links the Alhena AI app to access the specific Confluence pages. Once connected, Alhena AI can fetch data from Confluence pages.

Scraping an Entire Space
To crawl all pages in a Confluence space:
URL to add: The space overview URL, e.g.:
https://your-domain.atlassian.net/wiki/spaces/SPACEKEY/overviewhttps://your-domain.atlassian.net/wiki/spaces/SPACEKEY(also works)
Mode: Select "Multiple Pages"
What happens:
Alhena automatically discovers all pages in the space
Each page is fetched and added to your knowledge base
Blacklisting: If certain pages should be excluded, you can add their full URLs to the blacklist to skip them during crawling.
Scraping a Single Page
To crawl one specific Confluence page:
URL to add: The page URL, e.g.:
https://your-domain.atlassian.net/wiki/spaces/SPACEKEY/pages/12345/Page+Title
Mode: Either "Single Page" or "Multiple Pages" — doesn't matter. A page URL always scrapes just that one page (no sub-page discovery).
What happens:
Alhena fetches the page content and adds it to your knowledge base
Key Notes
No sub-page discovery: If you add a page URL, only that page is scraped — child/sub-pages are not automatically discovered. To get all pages, use the space URL with Multiple Pages mode.
Re-crawling: Subsequent crawls of the same space will skip pages that have already been imported.
Overview pages: Space overview URLs (
.../overview) are supported — the system resolves them to the space homepage.
Last updated