Graphlit Changelog
DocumentationDeveloper PortalMore Information
  • 🐰May 2025
    • May 11: Support for Amazon Bedrock models, McPoogle MCP search engine, bug fixes
  • 🐰April 2025
    • April 26: Support for OpenAI image generation, GPT-4.1, o3 and o4-mini models, bug fixes
    • April 13: Support for memory, email thread collections, Groq Llama 4 models, bug fixes
  • 🍀March 2025
    • March 27: Support for Twitter/X feed, Gemini 2.5 Pro model
    • March 15: Support for Podscan search, image similarity search, 'exists' and 'upsert' mutations
    • March 13: Support for classification workflow, notifications, Cohere Command A model, bug fixes
    • March 6: Support for MCP Server, Mistral OCR, retrieveSources, GPT-4.5, Sonnet 3.7, bug fixes
  • 💌February 2025
    • February 16: Support for Trello feed, Assembly.AI audio transcription, OpenAI o3-mini, bug fixes
  • 🎆January 2025
    • January 30: Support for Uppy file uploader, Deepseek Reasoner model, bug fixes
    • January 19: Support for at-cost LLM token pricing, multi-tenant feed deletion, bug fixes
    • January 10: Support for conversation message images, email filtering, Diffbot API key, bug fixes
    • January 4: Support for askGraphlit mutation, storage policies, bug fixes
  • 🎄December 2024
    • December 27: Support for LLM fallbacks, native Google Docs formats, website unblocking, bug fixes
    • December 22: Support for Dropbox, Box, Intercom and Zendesk feeds, OpenAI o1, Gemini 2.0, bug fixes
    • December 9: Support for website mapping, web page screenshots, Groq Llama 3.3 model, bug fixes
    • December 1: Support for retrieval-only RAG pipeline, bug fixes
  • 🦃November 2024
    • November 24: Support for direct LLM prompt, multi-turn image analysis, bug fixes
    • November 16: Support for image description, multi-turn text summarization
    • November 10: Support for web search, multi-turn content summarization, Deepgram language detection
    • November 4: Support for Anthropic Claude 3.5 Haiku, bug fixes
  • 🎃October 2024
    • October 31: Support for simulated tool calling, bug fixes
    • October 22: Support for latest Anthropic Sonnet 3.5 model, Cohere image embeddings
    • October 21: Support OpenAI, Cohere, Jina, Mistral, Voyage and Google AI embedding models
    • October 9: Support for GitHub repository feeds, bug fixes
    • October 7: Support for Anthropic and Gemini tool calling
    • October 3: Support tool calling, ingestBatch mutation, Gemini Flash 1.5 8b, bug fixes
  • 🎒September 2024
    • September 30: Support for Azure AI Inference models, Mistral Pixtral and latest Google Gemini models
    • September 26: Support for Google AI and Cerebras models, and latest Groq models
    • September 3: Support for web search feeds, model deprecations
    • September 1: Support for FHIR enrichment, latest Cohere models, bug fixes
  • 🎂August 2024
    • August 20: Support for medical entities, Anthropic prompt caching, bug fixes
    • August 11: Support for Azure AI Document Intelligence by default, language-aware summaries
    • August 8: Support for LLM-based document extraction, .NET SDK, bug fixes
  • ☀️July 2024
    • July 28: Support for indexing workflow stage, Azure AI language detection, bug fixes
    • July 25: Support for Mistral Large 2 & Nemo, Groq Llama 3.1 models, bug fixes
    • July 19: Support for OpenAI GPT-4o Mini, BYO-key for Azure AI, similarity by summary, bug fixes
    • July 4: Support for webhook Alerts, keywords summarization, Deepseek 128k context window, bug fixes
  • 🎓June 2024
    • June 21: Support for the Claude 3.5 Sonnet model, knowledge graph semantic search, and bug fixes
    • June 9: Support for Deepseek models, JSON-LD webpage parsing, performance improvements and bug fixes
  • 💐May 2024
    • May 15: Support for GraphRAG, OpenAI GPT-4o model, performance improvements and bug fixes
    • May 5: Support for Jina and Pongo rerankers, Microsoft Teams feed, new YouTube downloader, bug fixes
  • 🐇April 2024
    • April 23: Support for Python and TypeScript SDKs, latest OpenAI, Cohere & Groq models, bug fixes
    • April 7: Support for Discord feeds, Cohere reranking, section-aware chunking and retrieval
  • 🍀March 2024
    • March 23: Support for Linear, GitHub Issues and Jira issue feeds, ingest files via Web feed sitemap
    • March 13: Support for Claude 3 Haiku model, direct ingestion of Base64 encoded files
    • March 10: Support for Claude 3, Mistral and Groq models, usage/credits telemetry, bug fixes
  • 🌧️February 2024
    • February 21: Support for OneDrive and Google Drive feeds, extract images from PDFs, bug fixes
    • February 2: Support for Semantic Alerts, OpenAI 0125 models, performance enhancements, bug fixes
  • 🎆January 2024
    • January 22: Support for Google and Microsoft email feeds, reingest content in-place, bug fixes
    • January 18: Support for content publishing, LLM tools, CLIP image embeddings, bug fixes
  • 🎄December 2023
    • December 10: Support for OpenAI GPT-4 Turbo, Llama 2 and Mistral models; query by example, bug fixes
  • 🎃October 2023
    • October 30: Optimized conversation responses; added observable aliases; bug fixes
    • October 15: Support for Anthropic Claude models, Slack feeds and entity enrichment
  • 🛠️September 2023
    • September 24: Support for YouTube feeds; added documentation; bug fixes
    • September 20: Paid subscription plans; support for custom observed entities & Azure OpenAI GPT-4
    • September 4: Workflow configuration; support for Notion feeds; document OCR
  • 🎂August 2023
    • August 17: Prepare for usage-based billing; append SAS tokens to URIs
    • August 9: Support direct text, Markdown and HTML ingestion; new Specification LLM strategy
    • August 3: New data model for Observations, new Category entity
  • 🎇July 2023
    • July 15: Support for SharePoint feeds, new Conversation features
  • Graphlit Platform
    • Data API Changelog
Powered by GitBook
On this page
  • New Features
  • Bugs Fixed
  1. April 2024

April 7: Support for Discord feeds, Cohere reranking, section-aware chunking and retrieval

PreviousApril 23: Support for Python and TypeScript SDKs, latest OpenAI, Cohere & Groq models, bug fixesNextMarch 23: Support for Linear, GitHub Issues and Jira issue feeds, ingest files via Web feed sitemap

Last updated 1 year ago

New Features

  • Graphlit now supports Discord feeds. By connecting to a Discord channel and providing a bot token, you can ingest all Discord messages and file attachments.

  • Graphlit now supports after content retrieval in RAG pipeline. You can optionally use the Cohere rerank model to semantically rerank the semantic search results, before providing as context to the LLM.

  • Added support for section-aware text chunking and retrieval. Now, when using section-aware document preparation, such as Azure AI Document Intelligence, Graphlit will store the extracted text according to the semantic chunks (i.e. sections). The text for each section will be individually chunked and embedded into the vector index.

  • Added support for retrievalStrategy in Specification type. Graphlit now supports CHUNK, SECTION and CONTENT retrieval strategies. Chunk retrieval will use the search hit chunk, section retrieval will expand the search hit chunk to the containing section (or page, if not using section-aware preparation). Content retrieval will expand the search hit chunk to the text of the entire document.

  • Added support for rerankingStrategy in Specification type. You can now configure the reranking of content sources, using the Cohere reranking model, by assigning serviceType to COHERE. More reranking models are planned for the future.

  • Added isSynchronous flag to content ingestion mutations, such as ingestUri, so the mutation will wait for the content to complete the ingestion workflow (or error) before returning. This is useful for utilizing the API in a Jupyter notebook or Streamlit application, in a synchronous manner without polling.

  • Added includeAttachments flag to SlackFeedProperties. When enabled, Graphlit will automatically ingest any attachments within Slack messages.

  • Added ingestUri mutation to replace the now deprecated ingestPage and ingestFile mutations. We had seen confusion on when to use one vs the other, and now for any URI, whether it is a web page or hosted PDF, you can pass it to ingestUri, and we will infer the correct content ingestion workflow.

  • Removed includeSummaries from the ConversationStrategyInput type. This will re-added in the future as part of the retrieval strategy.

  • Deprecated enableExpandedRetrieval in ConversationStrategyInput type. This is now handled by setting strategyType to SECTION or CONTENT in the RetrievalStrategyInput type.

  • Moved contentLimit from ConversationStrategyInput type to RetrievalStrategyInput type. You can optionally assign the contentLimit to retrievalStrategy which limits the number of content sources leveraged in the LLM prompt context. (Default is 100.)

Bugs Fixed

  • GPLA-2469: Failed to ingest PDF hosted on GitHub

  • GPLA-2390: Claude 3 Haiku not adhering to JSON schema

  • GPLA-2474: Prompt rewriting should ignore formatting instructions in prompt

  • GPLA-2462: Missing line break after table rows

  • GPLA-2417: Not extracting images from PPTX correctly

🐇
💡
💡
⚡
⚡
⚡
⚡
Cohere reranking