Graphlit Changelog
DocumentationDeveloper PortalMore Information
  • 🐰May 2025
    • May 11: Support for Amazon Bedrock models, McPoogle MCP search engine, bug fixes
  • 🐰April 2025
    • April 26: Support for OpenAI image generation, GPT-4.1, o3 and o4-mini models, bug fixes
    • April 13: Support for memory, email thread collections, Groq Llama 4 models, bug fixes
  • 🍀March 2025
    • March 27: Support for Twitter/X feed, Gemini 2.5 Pro model
    • March 15: Support for Podscan search, image similarity search, 'exists' and 'upsert' mutations
    • March 13: Support for classification workflow, notifications, Cohere Command A model, bug fixes
    • March 6: Support for MCP Server, Mistral OCR, retrieveSources, GPT-4.5, Sonnet 3.7, bug fixes
  • 💌February 2025
    • February 16: Support for Trello feed, Assembly.AI audio transcription, OpenAI o3-mini, bug fixes
  • 🎆January 2025
    • January 30: Support for Uppy file uploader, Deepseek Reasoner model, bug fixes
    • January 19: Support for at-cost LLM token pricing, multi-tenant feed deletion, bug fixes
    • January 10: Support for conversation message images, email filtering, Diffbot API key, bug fixes
    • January 4: Support for askGraphlit mutation, storage policies, bug fixes
  • 🎄December 2024
    • December 27: Support for LLM fallbacks, native Google Docs formats, website unblocking, bug fixes
    • December 22: Support for Dropbox, Box, Intercom and Zendesk feeds, OpenAI o1, Gemini 2.0, bug fixes
    • December 9: Support for website mapping, web page screenshots, Groq Llama 3.3 model, bug fixes
    • December 1: Support for retrieval-only RAG pipeline, bug fixes
  • 🦃November 2024
    • November 24: Support for direct LLM prompt, multi-turn image analysis, bug fixes
    • November 16: Support for image description, multi-turn text summarization
    • November 10: Support for web search, multi-turn content summarization, Deepgram language detection
    • November 4: Support for Anthropic Claude 3.5 Haiku, bug fixes
  • 🎃October 2024
    • October 31: Support for simulated tool calling, bug fixes
    • October 22: Support for latest Anthropic Sonnet 3.5 model, Cohere image embeddings
    • October 21: Support OpenAI, Cohere, Jina, Mistral, Voyage and Google AI embedding models
    • October 9: Support for GitHub repository feeds, bug fixes
    • October 7: Support for Anthropic and Gemini tool calling
    • October 3: Support tool calling, ingestBatch mutation, Gemini Flash 1.5 8b, bug fixes
  • 🎒September 2024
    • September 30: Support for Azure AI Inference models, Mistral Pixtral and latest Google Gemini models
    • September 26: Support for Google AI and Cerebras models, and latest Groq models
    • September 3: Support for web search feeds, model deprecations
    • September 1: Support for FHIR enrichment, latest Cohere models, bug fixes
  • 🎂August 2024
    • August 20: Support for medical entities, Anthropic prompt caching, bug fixes
    • August 11: Support for Azure AI Document Intelligence by default, language-aware summaries
    • August 8: Support for LLM-based document extraction, .NET SDK, bug fixes
  • ☀️July 2024
    • July 28: Support for indexing workflow stage, Azure AI language detection, bug fixes
    • July 25: Support for Mistral Large 2 & Nemo, Groq Llama 3.1 models, bug fixes
    • July 19: Support for OpenAI GPT-4o Mini, BYO-key for Azure AI, similarity by summary, bug fixes
    • July 4: Support for webhook Alerts, keywords summarization, Deepseek 128k context window, bug fixes
  • 🎓June 2024
    • June 21: Support for the Claude 3.5 Sonnet model, knowledge graph semantic search, and bug fixes
    • June 9: Support for Deepseek models, JSON-LD webpage parsing, performance improvements and bug fixes
  • 💐May 2024
    • May 15: Support for GraphRAG, OpenAI GPT-4o model, performance improvements and bug fixes
    • May 5: Support for Jina and Pongo rerankers, Microsoft Teams feed, new YouTube downloader, bug fixes
  • 🐇April 2024
    • April 23: Support for Python and TypeScript SDKs, latest OpenAI, Cohere & Groq models, bug fixes
    • April 7: Support for Discord feeds, Cohere reranking, section-aware chunking and retrieval
  • 🍀March 2024
    • March 23: Support for Linear, GitHub Issues and Jira issue feeds, ingest files via Web feed sitemap
    • March 13: Support for Claude 3 Haiku model, direct ingestion of Base64 encoded files
    • March 10: Support for Claude 3, Mistral and Groq models, usage/credits telemetry, bug fixes
  • 🌧️February 2024
    • February 21: Support for OneDrive and Google Drive feeds, extract images from PDFs, bug fixes
    • February 2: Support for Semantic Alerts, OpenAI 0125 models, performance enhancements, bug fixes
  • 🎆January 2024
    • January 22: Support for Google and Microsoft email feeds, reingest content in-place, bug fixes
    • January 18: Support for content publishing, LLM tools, CLIP image embeddings, bug fixes
  • 🎄December 2023
    • December 10: Support for OpenAI GPT-4 Turbo, Llama 2 and Mistral models; query by example, bug fixes
  • 🎃October 2023
    • October 30: Optimized conversation responses; added observable aliases; bug fixes
    • October 15: Support for Anthropic Claude models, Slack feeds and entity enrichment
  • 🛠️September 2023
    • September 24: Support for YouTube feeds; added documentation; bug fixes
    • September 20: Paid subscription plans; support for custom observed entities & Azure OpenAI GPT-4
    • September 4: Workflow configuration; support for Notion feeds; document OCR
  • 🎂August 2023
    • August 17: Prepare for usage-based billing; append SAS tokens to URIs
    • August 9: Support direct text, Markdown and HTML ingestion; new Specification LLM strategy
    • August 3: New data model for Observations, new Category entity
  • 🎇July 2023
    • July 15: Support for SharePoint feeds, new Conversation features
  • Graphlit Platform
    • Data API Changelog
Powered by GitBook
On this page
  • New Features
  • Bugs Fixed
  1. February 2024

February 21: Support for OneDrive and Google Drive feeds, extract images from PDFs, bug fixes

PreviousMarch 10: Support for Claude 3, Mistral and Groq models, usage/credits telemetry, bug fixesNextFebruary 2: Support for Semantic Alerts, OpenAI 0125 models, performance enhancements, bug fixes

Last updated 1 year ago

New Features

  • Graphlit now supports OneDrive and Google Drive feeds. Files can be ingested from OneDrive or Google Drive, including shared drives where the authenticated user has access. Both OneDrive and Google Drive support the reading of existing files, and tracking new files added to storage with recurrent feeds.

  • Graphlit now supports email backup files, such as EML or MSG, which will be assigned the EMAIL file type. During email file preparation, we will automatically extract and ingest any file attachments.

  • Graphlit now automatically extracts embedded images in PDF files, ingests them as content objects, and links them as children of the parent PDF.

  • Graphlit now supports recursive Notion feeds. When the isRecursive flag is true in the Notion feed properties, we will crawl child pages and databases, and recursively ingest them in addition to the specified pages and databases.

  • Added support for assigning collections to content ingested with the ingestPage, ingestFile or ingestText mutations. This saves a step where the content will automatically be added to the collection(s) without requiring another mutation call.

  • Added support for the CODE file type for a wide variety of source code formats, i.e. Python .py, Javascript .js. Code files use optimized text splitting for enhanced search and retrieval.

  • Added support for customGuidance in Specification object, which can be used for injecting a guidance prompt during the RAG process. For example, you can instruct the LLM to return a default response string if no content sources are found via semantic search.

  • Added tenants field to Project object, which returns a list of all tenant IDs which have been used to create an entity in Graphlit.

  • Added email metadata, separate from document metadata. Now emails will contain indexed metadata such as to, from, or subject.

  • The contents field for content objects has been replaced with children and parent fields. For example, when a ZIP file is unpacked, the unpacked files will be added as children of the ZIP file, and the ZIP file will be the parent of each of the unpacked files.

  • Removed enableImageAnalysis field from image preparation properties in workflow object. Now is enabled by default.

  • Moved disableSmartCapture field to preparation workflow stage from page preparation properties. This is used to disable the use of headless Chrome browser to capture HTML from web pages. It is enabled by default, and if disabled, Graphlit will simply download the HTML from the web page rather than rendering on headless Chrome browser.

Bugs Fixed

  • GPLA-2099: Failed to ingest ArXiV PDF. Fixed PDF parsing error.

  • GPLA-2174: LLM response is incorrect with conversation history, but no content sources.

  • GPLA-2199: ZIP package left in Indexed state after content workflow.

🌧️
💡
💡
💡
💡
⚡
⚡
⚡