ManishMadan2882
|
bbce872ac5
|
(fix:chunker) combine metadata as well
|
2025-07-04 02:19:58 +05:30 |
|
ManishMadan2882
|
0f7ebcd8e4
|
(feat:dir-reader) store mime types, file size in db
|
2025-07-03 18:09:19 +05:30 |
|
ManishMadan2882
|
82fc19e7b7
|
(fix:dir-reader) conflict of same filename in dir
|
2025-07-03 17:28:12 +05:30 |
|
ManishMadan2882
|
2ef23fe1b3
|
(feat:dir-reader) maintain dir structure in db
|
2025-07-03 01:24:22 +05:30 |
|
ManishMadan2882
|
fd905b1a06
|
(feat:dir-reader) save tokens with filenames
|
2025-07-02 16:30:29 +05:30 |
|
ManishMadan2882
|
e1aa2cc0b8
|
(fix:ingestion) store file name as metadata, not path
|
2025-05-09 02:26:35 +05:30 |
|
Alex
|
481df4d604
|
fix: enhance error logging with exception info across multiple modules
|
2025-05-05 13:12:39 +01:00 |
|
Pavel
|
57a6fb31b2
|
periodic header injection
|
2025-03-31 22:28:04 +04:00 |
|
asminkarki012
|
c70be12bfd
|
fix[csv_parser]:missing header
|
2025-03-28 22:46:11 +05:45 |
|
Alex
|
d47232246a
|
fix: remove old pypdf
|
2025-02-06 19:59:42 +00:00 |
|
Pavel
|
fddee69f92
|
web loader fix
Changes web loader to the correct output.
|
2025-01-17 19:13:23 +03:00 |
|
Pavel
|
13fcbe3e74
|
scraper with markdownify
|
2025-01-15 01:08:09 +03:00 |
|
Alex
|
41b4c28430
|
fix: linting
|
2024-12-23 17:41:44 +00:00 |
|
Pavel
|
b41a989051
|
test version
|
2024-12-23 16:59:27 +00:00 |
|
GH Action - Upstream Sync
|
628f83172a
|
Merge branch 'main' of https://github.com/arc53/DocsGPT
|
2024-11-22 01:25:17 +00:00 |
|
Alex
|
a0a05b676f
|
Merge pull request #1303 from jayantp2003/bugfix/859-large-zip-breaking-stream-endpoint
Bugfix/859 large zip breaking stream endpoint
|
2024-11-21 17:34:21 +00:00 |
|
Alex
|
f65ecb9a0f
|
fix: lint import
|
2024-11-19 19:16:24 +00:00 |
|
Alex
|
312cb9ae70
|
feat: image parser
|
2024-11-19 19:06:53 +00:00 |
|
Alex
|
63b547ea13
|
fix: delete old files
|
2024-11-17 12:59:34 +00:00 |
|
Alex
|
2245f4690e
|
fix: reddit loader validation
|
2024-11-15 11:02:27 +00:00 |
|
JeevaRamanathan M
|
5c756348a5
|
feat: Presentation parser implementation
Signed-off-by: JeevaRamanathan M <jeevaramanathan.m@infosys.com>
|
2024-10-31 11:47:12 +00:00 |
|
Alex
|
1c791f240a
|
Merge pull request #1377 from JeevaRamanathan/feature/file-json
feat: JSON Parser Implementation
|
2024-10-26 17:28:57 +01:00 |
|
JeevaRamanathan M
|
c77d415893
|
feat: JSON parser implementation
Signed-off-by: JeevaRamanathan M <jeevaramanathan.m@infosys.com>
|
2024-10-24 20:36:47 +00:00 |
|
devendra.parihar
|
d3238de8ab
|
fix: lint error
|
2024-10-18 12:23:17 +05:30 |
|
devendra.parihar
|
09a2705311
|
fix:GitHubLoader to Handle Binary Files
|
2024-10-18 12:08:08 +05:30 |
|
devendra.parihar
|
a4c0861cf4
|
fix:GitHubLoader to Handle Binary Files
|
2024-10-18 12:07:44 +05:30 |
|
jayantp2003
|
3db07f3a26
|
Fix #859: Resolve issue with large zip breaking stream endpoint
|
2024-10-11 17:10:12 +05:30 |
|
jayantp2003
|
a2ef45e13f
|
Fix #859: Resolve issue with large zip breaking stream endpoint
|
2024-10-11 17:08:04 +05:30 |
|
Alex
|
c9e95a9146
|
Merge pull request #1184 from Devparihar5/ExcelParser
new: added ExcelParser(tested) to read .xlsx files
|
2024-10-06 23:19:37 +01:00 |
|
Alex
|
6932c7e3e9
|
feat: add filename to the top
|
2024-10-05 21:56:47 +01:00 |
|
Alex
|
c04687fdd1
|
fix: github loader metadata clickable
|
2024-10-05 21:53:30 +01:00 |
|
Alex
|
7717242112
|
fix(lint): ruff var
|
2024-10-05 21:37:55 +01:00 |
|
Alex
|
1ad82c22d9
|
fix: headers
|
2024-10-05 21:36:04 +01:00 |
|
Alex
|
8fa88175c1
|
fix: translation + auth
|
2024-10-05 21:33:58 +01:00 |
|
Alex
|
2611550ffd
|
|
2024-10-02 23:44:29 +01:00 |
|
devendra.parihar
|
7794129929
|
new: added ExcelParser(tested) to read .xlsx files
|
2024-10-01 22:03:10 +05:30 |
|
Siddhant Rai
|
3d292aa485
|
feat: sync remote sources through celery periodic tasks
|
2024-09-25 15:20:11 +05:30 |
|
Alex
|
44d225e6ca
|
Merge branch 'main' into 1059-migrating-database-to-new-model
|
2024-09-09 23:55:25 +01:00 |
|
Alex
|
2f9c72c1cf
|
feat: migrate store to source_id
|
2024-09-09 15:46:18 +01:00 |
|
Alex
|
1bb81614a5
|
fix: metadata things
|
2024-09-09 13:37:11 +01:00 |
|
Alex
|
8166642ff9
|
fix: write id instead of old path on remote db's
|
2024-09-09 12:00:59 +01:00 |
|
Alex
|
c49b7613e0
|
fix: langchain warning
|
2024-08-31 12:53:37 +01:00 |
|
Alex
|
16aedd61da
|
fix: ruff lint
|
2024-08-12 16:37:03 +01:00 |
|
Alex
|
5a2f3ad616
|
feat: remove dep
|
2024-08-12 16:35:23 +01:00 |
|
ManishMadan2882
|
9000838aab
|
(feat:vectors): calc, add token in db
|
2024-05-24 21:10:50 +05:30 |
|
Siddhant Rai
|
53e86205ad
|
fix: added more headers from default
|
2024-05-03 18:47:30 +05:30 |
|
Siddhant Rai
|
aa670efe3a
|
fix: connection aborted in WebBaseLoader
|
2024-05-03 18:25:01 +05:30 |
|
Alex
|
8d7a134cb4
|
lint: ruff
|
2024-04-09 17:25:08 +01:00 |
|
Siddhant Rai
|
e01071426f
|
feat: field to pass number of posts as a parameter
|
2024-03-27 19:20:55 +05:30 |
|
Siddhant Rai
|
eed1bfbe50
|
feat: fields to handle reddit loader + minor changes
|
2024-03-26 16:07:44 +05:30 |
|