DocsGPT

mirror of https://github.com/arc53/DocsGPT.git synced 2026-05-15 17:03:47 +00:00

Author	SHA1	Message	Date
Alex	249f9f9fe0	perf(streaming): batch message_events INSERTs per stream complete_stream previously opened a fresh db_session() per yielded event, doing one Postgres INSERT + commit per chunk on the WSGI thread. Streaming answers emit ~100s of answer chunks per response, so the route was paying ~100 PG roundtrips per stream serialized on commit latency. New BatchedJournalWriter in application/streaming/message_journal.py accumulates rows per stream and flushes on three triggers: - size: buffer reaches 16 entries - time: 100ms elapsed since the last flush - lifecycle: close() at end-of-stream Live pubsub publishes still fire synchronously per record(), so subscribers see events in real time — only the durable journal write is amortized. On bulk INSERT IntegrityError the writer falls back to per-row record() with the existing seq+1 retry so a single colliding seq doesn't drop the rest of the batch. complete_stream wires journal_writer.close() into every exit path (happy end, tool-approval-paused end, GeneratorExit, error handler) so the terminal event is committed before the generator returns — otherwise a reconnecting client could snapshot up to the last flush boundary and live-tail waiting for an end that's still in memory. Repository gets bulk_record() — one SQLAlchemy executemany INSERT for the bulk path. All-or-nothing on collision (Postgres aborts the whole batch); the writer's per-row fallback handles recovery.	2026-05-12 18:20:19 +01:00
Alex	ed9444cf3d	feat: SSE notification system Adds a per-user SSE pipe (GET /api/events) plus a per-message chat-stream reconnect endpoint (GET /api/messages/<id>/events). Backend substrate: - application/events/ — durable journal (Redis Streams) + live pub/sub for user-scoped events, with publish_user_event() as the worker-side entrypoint. - application/streaming/ — broadcast_channel for pub/sub fanout and event_replay for the per-message snapshot+tail path. - application/storage/db/repositories/message_events.py + alembic 0007 — Postgres journal for chat-stream events. - application/worker.py — ingest/reingest/remote/connector/ attachment/mcp_oauth tasks publish queued/progress/completed/ failed envelopes alongside their existing status updates. Frontend client: - frontend/src/events/ — connect/reconnect, Last-Event-ID cursor, backoff with jitter. Each tab runs its own connection; no cross-tab dedup (future work). - frontend/src/notifications/ — recentEvents ring, cursor tracking, tool-approval toast. - frontend/src/upload/uploadSlice.ts — extraReducers for source.ingest.* and attachment.* events. Coverage: 132 SSE tests across events substrate, replay, journal, routes, and worker publishes.	2026-05-12 14:29:45 +01:00
Alex	b4c4ab68f0	feat: durability and idempotency keys (#2450 ) * feat: durability and idempotency keys * feat: more durable frontend * fix: tests * fix: mini issues * fix: better json validation * fix: tests	2026-05-04 23:25:41 +01:00
Alex	552bfe016a	fix: better token counting and fixes cache	2026-04-28 01:47:53 +01:00
Alex	318de18d43	feat: BYOM (#2433 )	2026-04-27 22:09:33 +01:00
Alex	c06888bc86	feat: asgi and search service (#2424 ) * feat: asgi and search service * feat: asgi and mcp tool server * fix: asgi issues * fix: mini cors hardening	2026-04-23 12:21:39 +01:00
Alex	81b6ee5daa	Pg 4 (#2390 ) * feat: postgres tests * feat: mongo cutoff * feat: mongo cutoff * feat: adjust docs and compose files * fix: mini code mongo removals * fix: tests and k8s mongo stuff * feat: test fixes * fix: ruff * fix: vale * Potential fix for pull request finding 'CodeQL / Clear-text logging of sensitive information' Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> * fix: mini suggestions * vale lint fix 2 * fix: codeql columns thing * fix: test mongo * fix: tests coverage * feat: better tests 4 * feat: more tests * feat: decent coverage * fix: ruff fixes * fix: remove mongo mock * feat: enhance workflow engine and API routes; add document retrieval and source handling * feat: e2e tests * fix: mcp, mongo and more * fix: mini codeql warning * fix: agent chunk view * fix: mini issues * fix: more pg fixes * feat: postgres prep on start * feat: qa tests * fix: mini improvements * fix: tests --------- Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Co-authored-by: Siddhant Rai <siddhant.rai.5686@gmail.com>	2026-04-18 13:13:57 +01:00
Alex	0f20adcbf4	feat: pre depriciation	2026-04-14 00:19:50 +01:00
Alex	502819ae52	feat: pg migration, more tables	2026-04-12 12:15:59 +01:00
Alex	0c15af90b1	feat: history overwrite	2026-04-06 14:42:01 +01:00
Alex	d711eefe96	patch: agent usage limits	2026-04-03 18:03:31 +01:00
Alex	8b9e595d85	fix: structure improvements of messages	2026-04-01 14:58:44 +01:00
Alex	e04baa7ed8	feat: tests and approval gate	2026-04-01 12:49:32 +01:00
Alex	73256389cf	feat: client side tools	2026-03-31 22:20:55 +01:00
Alex	d609efca49	feat: continuation messages	2026-03-31 21:30:24 +01:00
Alex	f7bfd38b28	fix: proper fallback handling within agent during stream	2026-03-26 12:52:30 +00:00
Alex	72393dc369	feat: improve research	2026-03-25 17:42:24 +00:00
Alex	556b0a1da5	feat: research init	2026-03-25 15:16:18 +00:00
Alex	32c268a21e	refactor: simplify agent architecture and remove ReActAgent	2026-03-25 12:47:17 +00:00
Siddhant Rai	13ad3b5dce	feat: enhance logging and error handling across various tools; update DuckDuckGo dependency (#2282 ) Co-authored-by: Alex <a@tushynski.me>	2026-03-12 16:50:29 +00:00
Alex	5006271abb	fix stream stuff (#2293 )	2026-03-11 11:43:27 +00:00
Alex	1a2104f474	fix: token calc (#2285 )	2026-02-20 17:37:47 +00:00
Siddhant Rai	8ef321d784	feat: agent workflow builder (#2264 ) * feat: implement WorkflowAgent and GraphExecutor for workflow management and execution * refactor: workflow schemas and introduce WorkflowEngine - Updated schemas in `schemas.py` to include new agent types and configurations. - Created `WorkflowEngine` class in `workflow_engine.py` to manage workflow execution. - Enhanced `StreamProcessor` to handle workflow-related data. - Added new routes and utilities for managing workflows in the user API. - Implemented validation and serialization functions for workflows. - Established MongoDB collections and indexes for workflows and related entities. * refactor: improve WorkflowAgent documentation and update type hints in WorkflowEngine * feat: workflow builder and managing in frontend - Added new endpoints for workflows in `endpoints.ts`. - Implemented `getWorkflow`, `createWorkflow`, and `updateWorkflow` methods in `userService.ts`. - Introduced new UI components for alerts, buttons, commands, dialogs, multi-select, popovers, and selects. - Enhanced styling in `index.css` with new theme variables and animations. - Refactored modal components for better layout and styling. - Configured TypeScript paths and Vite aliases for cleaner imports. * feat: add workflow preview component and related state management - Implemented WorkflowPreview component for displaying workflow execution. - Created WorkflowPreviewSlice for managing workflow preview state, including queries and execution steps. - Added WorkflowMiniMap for visual representation of workflow nodes and their statuses. - Integrated conversation handling with the ability to fetch answers and manage query states. - Introduced reusable Sheet component for UI overlays. - Updated Redux store to include workflowPreview reducer. * feat: enhance workflow execution details and state management in WorkflowEngine and WorkflowPreview * feat: enhance workflow components with improved UI and functionality - Updated WorkflowPreview to allow text truncation for better display of long names. - Enhanced BaseNode with connectable handles and improved styling for better visibility. - Added MobileBlocker component to inform users about desktop requirements for the Workflow Builder. - Introduced PromptTextArea component for improved variable insertion and search functionality, including upstream variable extraction and context addition. * feat(workflow): add owner validation and graph version support * fix: ruff lint --------- Co-authored-by: Alex <a@tushynski.me>	2026-02-11 14:15:24 +00:00
Alex	f910a82683	feat: add unauthorized response handling in StreamResource and bump deps	2025-12-27 14:23:37 +00:00
Alex	197e94302b	Patches (#2219 ) * feat: implement URL validation to prevent SSRF * feat: add zip extraction security * ruff fixes * fix: standardize error messages across API responses	2025-12-24 18:35:57 +02:00
Alex	40c3e5568c	fix search (#2210 ) * fix search * fix ruff	2025-12-22 00:51:06 +02:00
Alex	af3e16c4fc	fix: count history tokens from chunks, remove old UI setting limit (#2196 )	2025-12-17 03:34:17 +02:00
Alex	e0a9f08632	refactor and deps (#2184 )	2025-12-10 23:53:59 +02:00
Alex	67e0d222d1	fix: model in agents via api (#2174 )	2025-11-25 13:54:34 +02:00
Alex	17698ce774	feat: context compression (#2173 ) * feat: context compression * fix: ruff	2025-11-24 12:44:19 +02:00
Siddhant Rai	3f7de867cc	feat: model registry and capabilities for multi-provider support (#2158 ) * feat: Implement model registry and capabilities for multi-provider support - Added ModelRegistry to manage available models and their capabilities. - Introduced ModelProvider enum for different LLM providers. - Created ModelCapabilities dataclass to define model features. - Implemented methods to load models based on API keys and settings. - Added utility functions for model management in model_utils.py. - Updated settings.py to include provider-specific API keys. - Refactored LLM classes (Anthropic, OpenAI, Google, etc.) to utilize new model registry. - Enhanced utility functions to handle token limits and model validation. - Improved code structure and logging for better maintainability. * feat: Add model selection feature with API integration and UI component * feat: Add model selection and default model functionality in agent management * test: Update assertions and formatting in stream processing tests * refactor(llm): Standardize model identifier to model_id * fix tests --------- Co-authored-by: Alex <a@tushynski.me>	2025-11-14 13:13:19 +02:00
Siddhant Rai	21e5c261ef	feat: template-based prompt rendering with dynamic namespace injection (#2091 ) * feat: template-based prompt rendering with dynamic namespace injection * refactor: improve template engine initialization with clearer formatting * refactor: streamline ReActAgent methods and improve content extraction logic feat: enhance error handling in NamespaceManager and TemplateEngine fix: update NewAgent component to ensure consistent form data submission test: modify tests for ReActAgent and prompt renderer to reflect method changes and improve coverage * feat: tools namespace + three-tier token budget * refactor: remove unused variable assignment in message building tests * Enhance prompt customization and tool pre-fetching functionality * ruff lint fix * refactor: cleaner error handling and reduce code clutter --------- Co-authored-by: Alex <a@tushynski.me>	2025-10-31 12:47:44 +00:00
Ali Arda Fincan	ce32dd2907	Feat: Agent Token or Request Limiting (#2041 ) * Update routes.py, added token and request limits to create/update agent operations * added usage limit check to api endpoints cannot create agents with usage limit right now that will be implemented * implemented api limiting as either token limiting or request limiting modes * minor typo & bug fix	2025-10-13 21:32:46 +03:00
Manish Madan	a4507008c1	complete_stream: Stop response streaming (#2031 ) * (feat:pause-stream) generator exit * (feat:pause-stream) close request * (feat:pause-stream) finally close; google anthropic --------- Co-authored-by: GH Action - Upstream Sync <action@github.com>	2025-10-08 20:37:30 +03:00
Alex	b910f308f2	fix: api answer tool call event	2025-09-30 14:42:54 +01:00
Siddhant Rai	adcdce8d76	fix: handle invalid chunks value in StreamProcessor and ClassicRAG	2025-09-10 22:10:11 +05:30
Siddhant Rai	b865a7aec1	Merge branch 'main' of https://github.com/siiddhantt/DocsGPT into pr/1930	2025-09-10 20:15:20 +05:30
Siddhant Rai	2f88890c94	feat: add support for multiple sources in agent configuration and update related components	2025-09-08 22:10:08 +05:30
Alex	44d21ab703	fix: passing sources and chunk if agent is shared	2025-08-22 13:36:31 +01:00
Alex	15d2d0115b	Merge branch 'main' into feat/agent-schema-response	2025-08-13 17:12:26 +01:00
Siddhant Rai	896dcf1f9e	feat: add support for structured output and JSON schema validation	2025-08-13 13:29:51 +05:30
Alex	f94a093e8c	fix: truncate long text fields to prevent overflow in logs and sources	2025-08-11 14:56:31 +01:00
Alex	092c01cae7	fix: ruff lint	2025-08-05 12:22:33 +01:00
Alex	4caff0fcf6	fix: enhance error logging for malformed request in stream route	2025-08-04 11:41:41 +01:00
Siddhant Rai	212952f3e9	fix: allow api call in stream route + get_prompt error	2025-07-25 16:17:18 +05:30
Siddhant Rai	76973a4b4c	feat: answer routes re-structure for better maintainability and reuse	2025-07-23 20:07:42 +05:30
copilot-swe-agent[bot]	2a4ec0cf5b	Fix conversation summary prompt to use user query language Co-authored-by: dartpain <15183589+dartpain@users.noreply.github.com>	2025-07-15 09:33:52 +00:00
Pavel	327ae35420	Agent docs upd 1. Added a page about interacting with agent API. 2. Added a page about interacting with agent webhooks. 3. Fixed small bug with /api/answer	2025-06-24 16:48:12 +02:00
Siddhant Rai	3353c0ee1d	Merge branch 'main' into refactor/llm-handler	2025-06-11 19:27:33 +05:30
Siddhant Rai	3351f71813	refactor: tool calls sent when pending and after completion	2025-06-11 12:40:32 +05:30

1 2 3 4

172 Commits