docs: clarify tool-access boundary in prompt injection section

Merge pull request #2322 from arc53/dependabot/npm_and_yarn/extensions/react-widget/svgo-4.0.1
chore(deps-dev): bump svgo from 3.3.3 to 4.0.1 in /extensions/react-widget
2026-05-07 06:30:03 +00:00 · 2026-04-15 18:31:42 +01:00 · 2026-04-15 17:06:40 +05:30 · 2026-04-15 10:43:02 +00:00 · 2026-04-15 16:03:56 +05:30 · 2026-04-15 10:28:46 +00:00
6 changed files with 1103 additions and 1398 deletions
--- a/.github/THREAT_MODEL.md
+++ b/.github/THREAT_MODEL.md
@@ -0,0 +1,144 @@
+# DocsGPT Public Threat Model
+
+**Classification:** Public  
+**Last updated:** 2026-04-15  
+**Applies to:** Open-source and self-hosted DocsGPT deployments
+
+## 1) Overview
+
+DocsGPT ingests content (files/URLs/connectors), indexes it, and answers queries via LLM-backed APIs and optional tools.
+
+Core components:
+- Backend API (`application/`)
+- Workers/ingestion (`application/worker.py` and related modules)
+- Datastores (MongoDB/Redis/vector stores)
+- Frontend (`frontend/`)
+- Optional extensions/integrations (`extensions/`)
+
+## 2) Scope and assumptions
+
+In scope:
+- Application-level threats in this repository.
+- Local and internet-exposed self-hosted deployments.
+
+Assumptions:
+- Internet-facing instances enable auth and use strong secrets.
+- Datastores/internal services are not publicly exposed.
+
+Out of scope:
+- Cloud hardware/provider compromise.
+- Security guarantees of external LLM vendors.
+- Full security audits of third-party systems targeted by tools (external DBs/MCP servers/code-exec APIs).
+
+## 3) Security objectives
+
+- Protect document/conversation confidentiality.
+- Preserve integrity of prompts, agents, tools, and indexed data.
+- Maintain API/worker availability.
+- Enforce tenant isolation in authenticated deployments.
+
+## 4) Assets
+
+- Documents, attachments, chunks/embeddings, summaries.
+- Conversations, agents, workflows, prompt templates.
+- Secrets (JWT secret, `INTERNAL_KEY`, provider/API/OAuth credentials).
+- Operational capacity (worker throughput, queue depth, model quota/cost).
+
+## 5) Trust boundaries and untrusted input
+
+Trust boundaries:
+- Internet ↔ Frontend
+- Frontend ↔ Backend API
+- Backend ↔ Workers/internal APIs
+- Backend/workers ↔ Datastores
+- Backend ↔ External LLM/connectors/remote URLs
+
+Untrusted input includes API payloads, file uploads, remote URLs, OAuth/webhook data, retrieved content, and LLM/tool arguments.
+
+## 6) Main attack surfaces
+
+1. Auth/authz paths and sharing tokens.
+2. File upload + parsing pipeline.
+3. Remote URL fetching and connectors (SSRF risk).
+4. Agent/tool execution from LLM output.
+5. Template/workflow rendering.
+6. Frontend rendering + token storage.
+7. Internal service endpoints (`INTERNAL_KEY`).
+8. High-impact integrations (SQL tool, generic API tool, remote MCP tools).
+
+## 7) Key threats and expected mitigations
+
+### A. Auth/authz misconfiguration
+- Threat: weak/no auth or leaked tokens leads to broad data access.
+- Mitigations: require auth for public deployments, short-lived tokens, rotation/revocation, least-privilege sharing.
+
+### B. Untrusted file ingestion
+- Threat: malicious files/archives trigger traversal, parser exploits, or resource exhaustion.
+- Mitigations: strict path checks, archive safeguards, file limits, patched parser dependencies.
+
+### C. SSRF/outbound abuse
+- Threat: URL loaders/tools access private/internal/metadata endpoints.
+- Mitigations: validate URLs + redirects, block private/link-local ranges, apply egress controls/allowlists.
+
+### D. Prompt injection + tool abuse
+- Threat: retrieved text manipulates model behavior and causes unsafe tool calls.
+- Threat: never rely on the model to "choose correctly" under adversarial input.
+- Mitigations: treat retrieved/model output as untrusted, enforce tool policies, only expose tools explicitly assigned by the user/admin to that agent, separate system instructions from retrieved content, audit tool calls.
+
+### E. Dangerous tool capability chaining (SQL/API/MCP)
+- Threat: write-capable SQL credentials allow destructive queries.
+- Threat: API tool can trigger side effects (infra/payment/webhook/code-exec endpoints).
+- Threat: remote MCP tools may expose privileged operations.
+- Mitigations: read-only-by-default credentials, destination allowlists, explicit approval for write/exec actions, per-tool policy enforcement + logging.
+
+### F. Frontend/XSS + token theft
+- Threat: XSS can steal local tokens and call APIs.
+- Mitigations: reduce unsafe rendering paths, strong CSP, scoped short-lived credentials.
+
+### G. Internal endpoint exposure
+- Threat: weak/unset `INTERNAL_KEY` enables internal API abuse.
+- Mitigations: fail closed, require strong random keys, keep internal APIs private.
+
+### H. DoS and cost abuse
+- Threat: request floods, large ingestion jobs, expensive prompts/crawls.
+- Mitigations: rate limits, quotas, timeouts, queue backpressure, usage budgets.
+
+## 8) Example attacker stories
+
+- Internet-exposed deployment runs with weak/no auth and receives unauthorized data access/abuse.
+- Intranet deployment intentionally using weak/no auth is vulnerable to insider misuse and lateral-movement abuse.
+- Crafted archive attempts path traversal during extraction.
+- Malicious URL/redirect chain targets internal services.
+- Poisoned document causes data exfiltration through tool calls.
+- Over-privileged SQL/API/MCP tool performs destructive side effects.
+
+## 9) Severity calibration
+
+- **Critical:** unauthenticated public data access; prompt-injection-driven exfiltration; SSRF to sensitive internal endpoints.
+- **High:** cross-tenant leakage, persistent token compromise, over-privileged destructive tools.
+- **Medium:** DoS/cost amplification and non-critical information disclosure.
+- **Low:** minor hardening gaps with limited impact.
+
+## 10) Baseline controls for public deployments
+
+1. Enforce authentication and secure defaults.
+2. Set/rotate strong secrets (`JWT`, `INTERNAL_KEY`, encryption keys).
+3. Restrict CORS and front API with a hardened proxy.
+4. Add rate limiting/quotas for answer/upload/crawl/token endpoints.
+5. Enforce URL+redirect SSRF protections and egress restrictions.
+6. Apply upload/archive/parsing hardening.
+7. Require least-privilege tool credentials and auditable tool execution.
+8. Monitor auth failures, tool anomalies, ingestion spikes, and cost anomalies.
+9. Keep dependencies/images patched and scanned.
+10. Validate multi-tenant isolation with explicit tests.
+
+## 11) Maintenance
+
+Review this model after major auth, ingestion, connector, tool, or workflow changes.
+
+## References
+
+- [OWASP Top 10 for LLM Applications](https://owasp.org/www-project-top-10-for-large-language-model-applications/)
+- [OWASP ASVS](https://owasp.org/www-project-application-security-verification-standard/)
+- [STRIDE overview](https://learn.microsoft.com/azure/security/develop/threat-modeling-tool-threats)
+- [DocsGPT SECURITY.md](../SECURITY.md)
--- a/application/requirements.txt
+++ b/application/requirements.txt
@@ -4,7 +4,7 @@ boto3==1.42.83
 beautifulsoup4==4.14.3
 cel-python==0.5.0
 celery==5.6.3
-cryptography==46.0.6
+cryptography==46.0.7
 dataclasses-json==0.6.7
 defusedxml==0.7.1
 docling>=2.16.0
@@ -29,14 +29,14 @@ jiter==0.13.0
 jmespath==1.1.0
 joblib==1.5.3
 jsonpatch==1.33
-jsonpointer==3.0.0
+jsonpointer==3.1.1
 kombu==5.6.2
 langchain==1.2.3
 langchain-community==0.4.1
-langchain-core==1.2.23
+langchain-core==1.2.29
 langchain-openai==1.1.12
 langchain-text-splitters==1.1.1
-langsmith==0.7.23
+langsmith==0.7.31
 lazy-object-proxy==1.12.0
 lxml==6.0.2
 markupsafe==3.0.3
@@ -84,7 +84,7 @@ tqdm==4.67.3
 transformers==5.4.0
 typing-extensions==4.15.0
 typing-inspect==0.9.0
-tzdata==2025.3
+tzdata==2026.1
 urllib3==2.6.3
 vine==5.1.0
 wcwidth==0.6.0
--- a/extensions/react-widget/package-lock.json
+++ b/extensions/react-widget/package-lock.json
--- a/extensions/react-widget/package.json
+++ b/extensions/react-widget/package.json
@@ -81,7 +81,7 @@
    "parcel": "^2.16.4",
    "prettier": "^3.8.1",
    "process": "^0.11.10",
-    "svgo": "^3.3.3",
+    "svgo": "^4.0.1",
    "typescript": "^5.3.3"
  },
  "publishConfig": {
--- a/frontend/package-lock.json
+++ b/frontend/package-lock.json
--- a/frontend/package.json
+++ b/frontend/package.json
@@ -33,12 +33,12 @@
    "i18next-browser-languagedetector": "^8.2.1",
    "lodash": "^4.17.21",
    "lucide-react": "^0.562.0",
-    "mermaid": "^11.12.1",
+    "mermaid": "^11.14.0",
    "prop-types": "^15.8.1",
    "radix-ui": "^1.4.3",
    "react": "^19.1.0",
    "react-chartjs-2": "^5.3.0",
-    "react-dom": "^19.1.1",
+    "react-dom": "^19.2.5",
    "react-dropzone": "^14.3.8",
    "react-google-drive-picker": "^1.2.2",
    "react-i18next": "^17.0.2",
@@ -53,10 +53,10 @@
    "tailwind-merge": "^3.4.0"
  },
  "devDependencies": {
-    "@tailwindcss/postcss": "^4.1.10",
+    "@tailwindcss/postcss": "^4.2.2",
    "@types/lodash": "^4.17.20",
    "@types/react": "^19.1.8",
-    "@types/react-dom": "^19.1.7",
+    "@types/react-dom": "^19.2.3",
    "@types/react-syntax-highlighter": "^15.5.13",
    "@typescript-eslint/eslint-plugin": "^8.58.2",
    "@typescript-eslint/parser": "^8.46.3",
@@ -64,7 +64,7 @@
    "eslint": "^9.39.1",
    "eslint-config-prettier": "^10.1.5",
    "eslint-plugin-import": "^2.31.0",
-    "eslint-plugin-n": "^17.23.1",
+    "eslint-plugin-n": "^17.24.0",
    "eslint-plugin-prettier": "^5.5.4",
    "eslint-plugin-promise": "^6.6.0",
    "eslint-plugin-react": "^7.37.5",
@@ -74,7 +74,7 @@
    "postcss": "^8.4.49",
    "prettier": "^3.5.3",
    "prettier-plugin-tailwindcss": "^0.7.2",
-    "tailwindcss": "^4.2.1",
+    "tailwindcss": "^4.2.2",
    "tw-animate-css": "^1.4.0",
    "typescript": "^5.8.3",
    "vite": "^8.0.0",
Author	SHA1	Message	Date
Alex	c18f85a050	docs: clarify tool-access boundary in prompt injection section	2026-04-15 18:31:42 +01:00
Manish Madan	5ecb174567	Merge pull request #2322 from arc53/dependabot/npm_and_yarn/extensions/react-widget/svgo-4.0.1 chore(deps-dev): bump svgo from 3.3.3 to 4.0.1 in /extensions/react-widget	2026-04-15 17:06:40 +05:30
dependabot[bot]	ed7212d016	chore(deps-dev): bump svgo in /extensions/react-widget Bumps [svgo](https://github.com/svg/svgo) from 3.3.3 to 4.0.1. - [Release notes](https://github.com/svg/svgo/releases) - [Commits](https://github.com/svg/svgo/compare/v3.3.3...v4.0.1) --- updated-dependencies: - dependency-name: svgo dependency-version: 4.0.1 dependency-type: direct:development update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com>	2026-04-15 10:43:02 +00:00
Manish Madan	f82acdab5d	Merge pull request #2384 from arc53/dependabot/npm_and_yarn/frontend/eslint-plugin-n-17.24.0 chore(deps-dev): bump eslint-plugin-n from 17.23.1 to 17.24.0 in /frontend	2026-04-15 16:03:56 +05:30
dependabot[bot]	361aebc34c	chore(deps-dev): bump eslint-plugin-n in /frontend Bumps [eslint-plugin-n](https://github.com/eslint-community/eslint-plugin-n) from 17.23.1 to 17.24.0. - [Release notes](https://github.com/eslint-community/eslint-plugin-n/releases) - [Changelog](https://github.com/eslint-community/eslint-plugin-n/blob/master/CHANGELOG.md) - [Commits](https://github.com/eslint-community/eslint-plugin-n/compare/v17.23.1...v17.24.0) --- updated-dependencies: - dependency-name: eslint-plugin-n dependency-version: 17.24.0 dependency-type: direct:development update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2026-04-15 10:28:46 +00:00
Manish Madan	bf194c1a0f	Merge pull request #2311 from arc53/dependabot/npm_and_yarn/extensions/react-widget/babel/preset-env-7.29.2 chore(deps-dev): bump @babel/preset-env from 7.24.6 to 7.29.2 in /extensions/react-widget	2026-04-15 13:16:41 +05:30
Manish Madan	54c396750b	Merge pull request #2385 from arc53/dependabot/npm_and_yarn/frontend/multi-2a6546692b chore(deps): bump react-dom and @types/react-dom in /frontend	2026-04-15 13:12:49 +05:30
dependabot[bot]	9adebfec69	chore(deps): bump react-dom and @types/react-dom in /frontend Bumps [react-dom](https://github.com/facebook/react/tree/HEAD/packages/react-dom) and [@types/react-dom](https://github.com/DefinitelyTyped/DefinitelyTyped/tree/HEAD/types/react-dom). These dependencies needed to be updated together. Updates `react-dom` from 19.2.0 to 19.2.5 - [Release notes](https://github.com/facebook/react/releases) - [Changelog](https://github.com/facebook/react/blob/main/CHANGELOG.md) - [Commits](https://github.com/facebook/react/commits/v19.2.5/packages/react-dom) Updates `@types/react-dom` from 19.2.2 to 19.2.3 - [Release notes](https://github.com/DefinitelyTyped/DefinitelyTyped/releases) - [Commits](https://github.com/DefinitelyTyped/DefinitelyTyped/commits/HEAD/types/react-dom) --- updated-dependencies: - dependency-name: react-dom dependency-version: 19.2.5 dependency-type: direct:production update-type: version-update:semver-patch - dependency-name: "@types/react-dom" dependency-version: 19.2.3 dependency-type: direct:development update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2026-04-15 07:41:54 +00:00
Manish Madan	92c321f163	Merge pull request #2386 from arc53/dependabot/npm_and_yarn/frontend/tailwindcss/postcss-4.2.2 chore(deps-dev): bump @tailwindcss/postcss from 4.1.16 to 4.2.2 in /frontend	2026-04-15 13:10:30 +05:30
dependabot[bot]	e3d36b9e52	chore(deps-dev): bump @tailwindcss/postcss in /frontend Bumps [@tailwindcss/postcss](https://github.com/tailwindlabs/tailwindcss/tree/HEAD/packages/@tailwindcss-postcss) from 4.1.16 to 4.2.2. - [Release notes](https://github.com/tailwindlabs/tailwindcss/releases) - [Changelog](https://github.com/tailwindlabs/tailwindcss/blob/main/CHANGELOG.md) - [Commits](https://github.com/tailwindlabs/tailwindcss/commits/v4.2.2/packages/@tailwindcss-postcss) --- updated-dependencies: - dependency-name: "@tailwindcss/postcss" dependency-version: 4.2.2 dependency-type: direct:development update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2026-04-15 07:39:25 +00:00
Manish Madan	8950e11208	Merge pull request #2388 from arc53/dependabot/npm_and_yarn/frontend/tailwindcss-4.2.2 chore(deps-dev): bump tailwindcss from 4.2.1 to 4.2.2 in /frontend	2026-04-15 12:39:31 +05:30
dependabot[bot]	5de0132a65	chore(deps-dev): bump tailwindcss from 4.2.1 to 4.2.2 in /frontend Bumps [tailwindcss](https://github.com/tailwindlabs/tailwindcss/tree/HEAD/packages/tailwindcss) from 4.2.1 to 4.2.2. - [Release notes](https://github.com/tailwindlabs/tailwindcss/releases) - [Changelog](https://github.com/tailwindlabs/tailwindcss/blob/main/CHANGELOG.md) - [Commits](https://github.com/tailwindlabs/tailwindcss/commits/v4.2.2/packages/tailwindcss) --- updated-dependencies: - dependency-name: tailwindcss dependency-version: 4.2.2 dependency-type: direct:development update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2026-04-15 07:06:26 +00:00
Manish Madan	b92ca91512	Merge pull request #2387 from arc53/dependabot/npm_and_yarn/frontend/mermaid-11.14.0 chore(deps): bump mermaid from 11.13.0 to 11.14.0 in /frontend	2026-04-15 12:34:28 +05:30
dependabot[bot]	8e0b2844a2	chore(deps): bump mermaid from 11.13.0 to 11.14.0 in /frontend Bumps [mermaid](https://github.com/mermaid-js/mermaid) from 11.13.0 to 11.14.0. - [Release notes](https://github.com/mermaid-js/mermaid/releases) - [Commits](https://github.com/mermaid-js/mermaid/compare/mermaid@11.13.0...mermaid@11.14.0) --- updated-dependencies: - dependency-name: mermaid dependency-version: 11.14.0 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2026-04-15 07:02:28 +00:00
dependabot[bot]	0969db5e30	chore(deps-dev): bump @babel/preset-env in /extensions/react-widget Bumps [@babel/preset-env](https://github.com/babel/babel/tree/HEAD/packages/babel-preset-env) from 7.24.6 to 7.29.2. - [Release notes](https://github.com/babel/babel/releases) - [Changelog](https://github.com/babel/babel/blob/main/CHANGELOG.md) - [Commits](https://github.com/babel/babel/commits/v7.29.2/packages/babel-preset-env) --- updated-dependencies: - dependency-name: "@babel/preset-env" dependency-version: 7.29.2 dependency-type: direct:development update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2026-04-15 05:30:32 +00:00
Manish Madan	af335a27e8	Merge pull request #2309 from arc53/dependabot/npm_and_yarn/extensions/react-widget/babel/core-7.29.0 chore(deps-dev): bump @babel/core from 7.24.6 to 7.29.0 in /extensions/react-widget	2026-04-15 10:59:07 +05:30
dependabot[bot]	0ae3139284	chore(deps-dev): bump @babel/core in /extensions/react-widget Bumps [@babel/core](https://github.com/babel/babel/tree/HEAD/packages/babel-core) from 7.24.6 to 7.29.0. - [Release notes](https://github.com/babel/babel/releases) - [Changelog](https://github.com/babel/babel/blob/main/CHANGELOG.md) - [Commits](https://github.com/babel/babel/commits/v7.29.0/packages/babel-core) --- updated-dependencies: - dependency-name: "@babel/core" dependency-version: 7.29.0 dependency-type: direct:development update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2026-04-14 23:05:51 +00:00
Manish Madan	7529ca3dd6	Merge pull request #2389 from arc53/dependabot/pip/application/pip-3344959f9f chore(deps): bump cryptography from 46.0.6 to 46.0.7 in /application in the pip group across 1 directory	2026-04-15 04:32:07 +05:30
dependabot[bot]	1b813320f1	chore(deps): bump cryptography Bumps the pip group with 1 update in the /application directory: [cryptography](https://github.com/pyca/cryptography). Updates `cryptography` from 46.0.6 to 46.0.7 - [Changelog](https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst) - [Commits](https://github.com/pyca/cryptography/compare/46.0.6...46.0.7) --- updated-dependencies: - dependency-name: cryptography dependency-version: 46.0.7 dependency-type: direct:production dependency-group: pip ... Signed-off-by: dependabot[bot] <support@github.com>	2026-04-14 21:46:13 +00:00
Manish Madan	02012e9a0b	Merge pull request #2366 from arc53/dependabot/pip/application/tzdata-2026.1 chore(deps): bump tzdata from 2025.3 to 2026.1 in /application	2026-04-15 03:14:25 +05:30
dependabot[bot]	c2f027265a	chore(deps): bump tzdata from 2025.3 to 2026.1 in /application Bumps [tzdata](https://github.com/python/tzdata) from 2025.3 to 2026.1. - [Release notes](https://github.com/python/tzdata/releases) - [Changelog](https://github.com/python/tzdata/blob/master/NEWS.md) - [Commits](https://github.com/python/tzdata/compare/2025.3...2026.1) --- updated-dependencies: - dependency-name: tzdata dependency-version: '2026.1' dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com>	2026-04-14 21:30:10 +00:00
Manish Madan	0ae615c10e	Merge pull request #2365 from arc53/dependabot/pip/application/langchain-core-1.2.26 chore(deps): bump langchain-core from 1.2.23 to 1.2.26 in /application	2026-04-15 02:58:36 +05:30
dependabot[bot]	881d0da344	chore(deps): bump langchain-core from 1.2.23 to 1.2.26 in /application Bumps [langchain-core](https://github.com/langchain-ai/langchain) from 1.2.23 to 1.2.26. - [Release notes](https://github.com/langchain-ai/langchain/releases) - [Commits](https://github.com/langchain-ai/langchain/compare/langchain-core==1.2.23...langchain-core==1.2.26) --- updated-dependencies: - dependency-name: langchain-core dependency-version: 1.2.26 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2026-04-14 21:10:32 +00:00
Manish Madan	1376de6bae	Merge pull request #2364 from arc53/dependabot/pip/application/langsmith-0.7.26 chore(deps): bump langsmith from 0.7.23 to 0.7.26 in /application	2026-04-15 02:39:17 +05:30
dependabot[bot]	362ebfcc0a	chore(deps): bump langsmith from 0.7.23 to 0.7.26 in /application Bumps [langsmith](https://github.com/langchain-ai/langsmith-sdk) from 0.7.23 to 0.7.26. - [Release notes](https://github.com/langchain-ai/langsmith-sdk/releases) - [Commits](https://github.com/langchain-ai/langsmith-sdk/compare/v0.7.23...v0.7.26) --- updated-dependencies: - dependency-name: langsmith dependency-version: 0.7.26 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2026-04-14 20:29:55 +00:00
Manish Madan	bc77eed3d8	Merge pull request #2363 from arc53/dependabot/pip/application/jsonpointer-3.1.1 chore(deps): bump jsonpointer from 3.0.0 to 3.1.1 in /application	2026-04-15 01:58:23 +05:30
dependabot[bot]	1f346588e7	chore(deps): bump jsonpointer from 3.0.0 to 3.1.1 in /application Bumps [jsonpointer](https://github.com/stefankoegl/python-json-pointer) from 3.0.0 to 3.1.1. - [Commits](https://github.com/stefankoegl/python-json-pointer/compare/v3.0.0...v3.1.1) --- updated-dependencies: - dependency-name: jsonpointer dependency-version: 3.1.1 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2026-04-14 20:13:38 +00:00
Alex	2fed5c882b	Merge pull request #2383 from arc53/codex/add-zizmor-ci-configuration Add GitHub Actions zizmor security workflow	2026-04-14 18:02:18 +01:00