Releases: assafelovic/gpt-researcher
v3.2.4
Many bug fixes and performance improvements. We've finally added a research history sidebar to the research tasks! Thank you and welcome to all new contributors!
What's Changed
- fix: unblock web socket message queue by @ewgdg in #1220
- perf: load balance nodriver browser scraper tabs by @ewgdg in #1219
- Fix: image relevance filtering by @czakop in #1215
- fix: cancel long running task gracefully by @ewgdg in #1221
- polish npm run build while retaining domain filter setting by @ElishaKay in #1224
- fix: add langchain-openai to project dependencies (to be safe on edge cases) by @milosz-l in #1227
- feature: improve JSON parsing in multi agent setting (#1137) by @czakop in #1230
- fix: resolve failing test in tests/test_logging_output.py by @grapestore in #1238
- fix: restrict Python version range in pyproject.toml by @grapestore in #1236
- New Sidebar feature of recent Research History by @assafelovic in #1240
- Fix KeyError: 'deep' in cli.py by @hd365247 in #1245
New Contributors
- @milosz-l made their first contribution in #1227
- @grapestore made their first contribution in #1238
- @hd365247 made their first contribution in #1245
Full Changelog: v3.2.3...v3.2.4
SimpleQA Evals and Deep Research 2.0
Another exciting week with so much improvements by our amazing community. We're thrilled to announce the latest release of GPT Researcher, now featuring evaluations using the SimpleQA dataset by OpenAI. Our rigorous testing has demonstrated an impressive 93% accuracy rate, surpassing all current leading projects in the market.
This achievement underscores the remarkable capabilities of the open-source community, and we're just getting started! In response to extensive feedback, we've refined our deep research functionalities to be faster, smarter, and more cost-effective, while also addressing previous bugs. Update to the latest version and experience the enhancements firsthand!
Here are results of our latest evals run:
Evaluation Summary
Debug counts:
Total successful: 100
CORRECT: 93
INCORRECT: 7
NOT_ATTEMPTED: 1
{
"correct_rate": 0.93,
"incorrect_rate": 0.07,
"not_attempted_rate": 0.01,
"answer_rate": 0.99,
"accuracy": 0.9292929292929293,
"f1": 0.9246231155778895
}
What's Changed
- Fix
Key Error
while using Deep Research by @kongacute in #1188 - Update requirements.txt with missing langgraph dep by @namin in #1189
- Fix Docker Build Failure: Updated
combined_query
inDeepRsearchSkill.run()
to Handle Backslashes in F-Strings by @monolok in #1192 - stabilize docker & frontend upgrades by @ElishaKay in #1191
- Improved overall planning and research performance by @assafelovic in #1195
- Added support for base_url param in create_chat_completions for OpenAI Provider by @gaurav3247 in #1198
- Update llm.py by @olipayne in #1200
- Fix WebSocket timeout issues by @luislofer89 in #1203
- fix: Add missing langgraph module to requirements.txt by @hurxxxx in #1207
- Refactor: typing cleanup by @czakop in #1187
- add async nodriver scrapper by @ewgdg in #1170
- Add language requirement to resource report prompt by @hurxxxx in #1208
- Feature:eval metrics by @kga245 in #1183
- README for feat(evals): Add SimpleQA evaluation framework and initial results by @kga245 in #1212
- Polish up loose ends based on feedback by @ElishaKay in #1211
New Contributors
- @namin made their first contribution in #1189
- @olipayne made their first contribution in #1200
- @luislofer89 made their first contribution in #1203
- @hurxxxx made their first contribution in #1207
- @czakop made their first contribution in #1187
Full Changelog: v3.2.2...v3.2.3
Deep Research now integrated with GPTR! 🎉
Wow, what a major release this is! We're so excited to release our very own version of Deep Research! Also, this release has so many performance improvements including a new integration with FireCrawl for production scraping!
🎉 Announcing Deep Research for GPT Researcher
We're thrilled to announce the release of Deep Research - a powerful new capability for GPT Researcher that takes autonomous research to the next level!
What is Deep Research?
Deep Research is an advanced recursive research system that explores topics with unprecedented depth and breadth using a tree-like exploration pattern. Think of it as deploying a team of specialized AI researchers, each diving deep into different aspects of your topic while maintaining a cohesive understanding of the bigger picture.
Key Features
- 🌳 Tree-like Exploration: Configurable depth and breadth parameters let you control how wide and deep the research goes
- ⚡️ Concurrent Processing: Multiple research paths are explored simultaneously for faster results
- 🤝 Smart Context Management: Automatic aggregation and synthesis of findings across all research branches
- 📊 Real-time Progress Tracking: Monitor your research progress across both breadth and depth dimensions
Performance & Resources
- ⏱️ Average completion time: ~5 minutes per deep research
- 💰 Cost-effective: ~$0.4 per research using
o3-mini
on "high" reasoning effort - 🎯 Produces comprehensive reports with multiple perspectives and deeper insights
Getting Started
Deep Research is available now in the latest version of GPT Researcher. You can enable it by:
- Using the PIP package with
report_type="deep"
- Selecting "Deep Research" in the web interface
- Configuring custom parameters through environment variables or config files
Documentation
For detailed information about Deep Research, including configuration options and best practices, visit our Deep Research documentation.
Try it out today and experience a new level of autonomous research capability!
What's Changed
- Fix: missing temperature parameter ( #1154 ) by @DaisukeUra in #1155
- Update Dependencies to Fix OpenAI API Parameter Error by @Jones0073 in #1161
- Update docker doc to add langchain packages by @azertylr in #1167
- Fixed constuct subtopics by @vinayvikram in #1172
- feat: Add BSHTMLLoader support and enhance error handling for document loading by @LavX in #1166
- Enhanced incorrect json format handling at agent_creator.py by @johnmalek312 in #1168
- Added the Azure-storage option for document sources. by @Shivam-19agg in #1165
- Filter by domain[s] by @ElishaKay in #1157
- Fix: ArxivScraper causing error when scraping the link. by @new-player in #1178
- Deep Research logic to enhance GPT Researcher capabilities by @assafelovic in #1179
- Add FireCrawl scaper by @kongacute in #1177
New Contributors
- @DaisukeUra made their first contribution in #1155
- @Jones0073 made their first contribution in #1161
- @azertylr made their first contribution in #1167
- @vinayvikram made their first contribution in #1172
- @LavX made their first contribution in #1166
- @johnmalek312 made their first contribution in #1168
- @Shivam-19agg made their first contribution in #1165
- @new-player made their first contribution in #1178
- @kongacute made their first contribution in #1177
Full Changelog: v3.2.1...v3.2.2
Join our Discord community to share your experience and feedback. :-)
v3.2.1
With the latest Deep Research hype - which you all know is actually what GPTR is all about already two years ago :), we've added support for reasoning models across the entire research workflow, including support for more deeper research. You can now easily add the following env vars to GPTR for a Deep Research experience!
What's Changed
- Update README.md for multi-agents by @danieldekay in #1138
- Fix scraper browser returns duplicate texts by @ewgdg in #1134
- refactor: get text from soup by @ewgdg in #1143
- fixed polling param for ubuntu host by @kga245 in #1151
- added helpful optional configs for dockerized hosting by @kga245 in #1150
- Adds support for o3-mini by @regismesquita in #1144
New Contributors
- @ewgdg made their first contribution in #1134
- @regismesquita made their first contribution in #1144
Full Changelog: v3.2.0...v3.2.1
v3.2.0
This release is focused on many performance improvements and bug fixes, making GPT Researcher more powerful than before. With the release of Deep Research, we've been hands down fixing and improving performance to sustain our position as the leading open deep research. Thanks and welcome to all the new contributors who helped push GPTR further!
What's Changed
- GPTR api url, npm package & other polishes by @ElishaKay in #1093
- fix(#1100): apply language config to Introduction and Conclusion sections by @DAAE-Na-Yunchae in #1101
- feature/support GigaChat LLM by @Rai220 in #1095
- Update Poetry to supported Python version by @chrishart0 in #1107
- Update tavily_search.py so TAVILY missing error no longer occurs by @cnukaus in #1113
- Add google_genai:text-embedding-004 by @gmh5225 in #1112
- fix: add language param to generate_outline_report_prompt by @fibonacci998 in #1109
- fix: update broken link in README by @fibonacci998 in #1117
- Fixed human in the loop by @electricjimi in #1116
- Fixed broken configuration link in llms.md by @Ajacmac in #1121
- feat: enhance scraper logging and title handling by @kga245 in #1118
- fix: update directory nextjs example in langgraph.md by @fibonacci998 in #1123
New Contributors
- @DAAE-Na-Yunchae made their first contribution in #1101
- @Rai220 made their first contribution in #1095
- @chrishart0 made their first contribution in #1107
- @cnukaus made their first contribution in #1113
- @gmh5225 made their first contribution in #1112
- @fibonacci998 made their first contribution in #1109
- @electricjimi made their first contribution in #1116
- @Ajacmac made their first contribution in #1121
Full Changelog: v3.1.9...v3.2.0
v3.1.9
What's Changed
- preserve user-specified report source instead of config default by @alesaccoia in #1070
- multi_agent: update CSS path to be relative to current file by @alesaccoia in #1071
- Logging update. Fixed regression and update to Docker log handling. by @kga245 in #1066
- Feature/add cursor rules, procfile addition by @kga245 in #1072
- Feature/add tone argument by @kga245 in #1074
- Update llms.md for Azure OpenAI by @danieldekay in #1083
- data ingestion docs and discord bot by @ElishaKay in #1075
- docs and docker fixes by @ElishaKay in #1085
New Contributors
- @alesaccoia made their first contribution in #1070
Full Changelog: v3.1.8...v3.1.9
v3.1.8
Adding support for LiteLLM as a generic LLM provider. In addition we now support DeepSeek!
What's Changed
- Add DeepSeek to SUPPORTED_PROVIDERS by @winsonluk in #1056
- add parameter to fix resource execution by @Italosayan in #1053
- Add Litellm to supported providers in GenericLLMProvider by @moritalous in #1060
- Remove server log.txt by @kga245 in #1059
New Contributors
- @Italosayan made their first contribution in #1053
Full Changelog: v.3.1.7...v3.1.8
v.3.1.7
This week includes another iteration on major performance improvements from our amazing community. Thanks to @kga245 for the awesome contribution, rebuilding the entire log handler for more robust capabilities. We've also added support for Tavily Extract that solves scale for developers shipping GPT Researcher in production. Tavily Extract allows to scrape any site at scale without getting blocked by site providers or dealing with the hassle of proxies, etc. Thank you to the best community on Github and looking forward to your next contributions. And like always welcome to our new contributors!
What's Changed
- Prints unexpected Google retriever search API response by @ivarprudnikov in #1036
- Logs handler improvements by @kga245 in #1038
- Added tutorial playlist to documentation by @gaurav3247 in #1042
- Bug fixes to unified logs by @kga245 in #1039
- Support Bedrock embedding model by @moritalous in #1045
- Update hybrid_research.md by @MC-shark in #1046
- 【Added】Enable support for processing online documents in hybrid mode by @MC-shark in #1043
- Adding Tavily extract by @pulvedu in #1044
- Fix Gemini bug when system role is provided by @winsonluk in #1048
New Contributors
- @ivarprudnikov made their first contribution in #1036
- @gaurav3247 made their first contribution in #1042
- @moritalous made their first contribution in #1045
- @pulvedu made their first contribution in #1044
Full Changelog: v3.1.6...v.3.1.7
v3.1.6
Another release of performance improvements and bug fixes. Welcome @kga245 as a new contributor with a super valuable contribution and special shoutout again to @winsonluk for helping improve GPTR's stability!
What's Changed
- Fix xAI bug by @winsonluk in #1021
- added language support by @assafelovic in #1026
- fix: Add language parameter support to generate_subtopic_report_prompt by @onlydole in #1029
- Fix generate_custom_report_prompt language bug by @winsonluk in #1033
- Introduce strategic_token_limit to fix Anthropic bug by @winsonluk in #1032
- Feature: unified logs handler by @kga245 in #1031
New Contributors
Full Changelog: v3.1.5...v3.1.6
v3.1.5
Welcome to all the new contributors and special shoutout to @winsonluk! This release includes critical bug fixes and improvements to report quality such as ranking sources and content before generation. This release is recommended for all!
What's Changed
- Adding a ranking step of research sources by @assafelovic in #992
- UX Fixes by @ElishaKay in #994
- Fix max_results for Google Search by @winsonluk in #1009
- Add xAI by @winsonluk in #1017
- Fix uncaught exception in Google retrieval by @winsonluk in #1018
- Default to the retriever set in config rather than just tavily by @winsonluk in #1019
- 【feature】Commit Message: Optimized PyMuPDFScraper to handle invalid o… by @MC-shark in #1012
- Add support for DashScope provider in LLM and embeddings modules by @AStupidBear in #1013
- Fix AWS Bedrock invocation; create output folder in cli.py if it doesn't exist by @ai-1st in #1014
New Contributors
- @winsonluk made their first contribution in #1009
- @MC-shark made their first contribution in #1012
- @AStupidBear made their first contribution in #1013
- @ai-1st made their first contribution in #1014
Full Changelog: v3.1.4...v3.1.5