Graphlit Changelog
DocumentationDeveloper PortalMore Information
  • 🐰April 2025
    • April 26: Support for OpenAI image generation, GPT-4.1, o3 and o4-mini models, bug fixes
    • April 13: Support for memory, email thread collections, Groq Llama 4 models, bug fixes
  • 🍀March 2025
    • March 27: Support for Twitter/X feed, Gemini 2.5 Pro model
    • March 15: Support for Podscan search, image similarity search, 'exists' and 'upsert' mutations
    • March 13: Support for classification workflow, notifications, Cohere Command A model, bug fixes
    • March 6: Support for MCP Server, Mistral OCR, retrieveSources, GPT-4.5, Sonnet 3.7, bug fixes
  • 💌February 2025
    • February 16: Support for Trello feed, Assembly.AI audio transcription, OpenAI o3-mini, bug fixes
  • 🎆January 2025
    • January 30: Support for Uppy file uploader, Deepseek Reasoner model, bug fixes
    • January 19: Support for at-cost LLM token pricing, multi-tenant feed deletion, bug fixes
    • January 10: Support for conversation message images, email filtering, Diffbot API key, bug fixes
    • January 4: Support for askGraphlit mutation, storage policies, bug fixes
  • 🎄December 2024
    • December 27: Support for LLM fallbacks, native Google Docs formats, website unblocking, bug fixes
    • December 22: Support for Dropbox, Box, Intercom and Zendesk feeds, OpenAI o1, Gemini 2.0, bug fixes
    • December 9: Support for website mapping, web page screenshots, Groq Llama 3.3 model, bug fixes
    • December 1: Support for retrieval-only RAG pipeline, bug fixes
  • 🦃November 2024
    • November 24: Support for direct LLM prompt, multi-turn image analysis, bug fixes
    • November 16: Support for image description, multi-turn text summarization
    • November 10: Support for web search, multi-turn content summarization, Deepgram language detection
    • November 4: Support for Anthropic Claude 3.5 Haiku, bug fixes
  • 🎃October 2024
    • October 31: Support for simulated tool calling, bug fixes
    • October 22: Support for latest Anthropic Sonnet 3.5 model, Cohere image embeddings
    • October 21: Support OpenAI, Cohere, Jina, Mistral, Voyage and Google AI embedding models
    • October 9: Support for GitHub repository feeds, bug fixes
    • October 7: Support for Anthropic and Gemini tool calling
    • October 3: Support tool calling, ingestBatch mutation, Gemini Flash 1.5 8b, bug fixes
  • 🎒September 2024
    • September 30: Support for Azure AI Inference models, Mistral Pixtral and latest Google Gemini models
    • September 26: Support for Google AI and Cerebras models, and latest Groq models
    • September 3: Support for web search feeds, model deprecations
    • September 1: Support for FHIR enrichment, latest Cohere models, bug fixes
  • 🎂August 2024
    • August 20: Support for medical entities, Anthropic prompt caching, bug fixes
    • August 11: Support for Azure AI Document Intelligence by default, language-aware summaries
    • August 8: Support for LLM-based document extraction, .NET SDK, bug fixes
  • ☀️July 2024
    • July 28: Support for indexing workflow stage, Azure AI language detection, bug fixes
    • July 25: Support for Mistral Large 2 & Nemo, Groq Llama 3.1 models, bug fixes
    • July 19: Support for OpenAI GPT-4o Mini, BYO-key for Azure AI, similarity by summary, bug fixes
    • July 4: Support for webhook Alerts, keywords summarization, Deepseek 128k context window, bug fixes
  • 🎓June 2024
    • June 21: Support for the Claude 3.5 Sonnet model, knowledge graph semantic search, and bug fixes
    • June 9: Support for Deepseek models, JSON-LD webpage parsing, performance improvements and bug fixes
  • 💐May 2024
    • May 15: Support for GraphRAG, OpenAI GPT-4o model, performance improvements and bug fixes
    • May 5: Support for Jina and Pongo rerankers, Microsoft Teams feed, new YouTube downloader, bug fixes
  • 🐇April 2024
    • April 23: Support for Python and TypeScript SDKs, latest OpenAI, Cohere & Groq models, bug fixes
    • April 7: Support for Discord feeds, Cohere reranking, section-aware chunking and retrieval
  • 🍀March 2024
    • March 23: Support for Linear, GitHub Issues and Jira issue feeds, ingest files via Web feed sitemap
    • March 13: Support for Claude 3 Haiku model, direct ingestion of Base64 encoded files
    • March 10: Support for Claude 3, Mistral and Groq models, usage/credits telemetry, bug fixes
  • 🌧️February 2024
    • February 21: Support for OneDrive and Google Drive feeds, extract images from PDFs, bug fixes
    • February 2: Support for Semantic Alerts, OpenAI 0125 models, performance enhancements, bug fixes
  • 🎆January 2024
    • January 22: Support for Google and Microsoft email feeds, reingest content in-place, bug fixes
    • January 18: Support for content publishing, LLM tools, CLIP image embeddings, bug fixes
  • 🎄December 2023
    • December 10: Support for OpenAI GPT-4 Turbo, Llama 2 and Mistral models; query by example, bug fixes
  • 🎃October 2023
    • October 30: Optimized conversation responses; added observable aliases; bug fixes
    • October 15: Support for Anthropic Claude models, Slack feeds and entity enrichment
  • 🛠️September 2023
    • September 24: Support for YouTube feeds; added documentation; bug fixes
    • September 20: Paid subscription plans; support for custom observed entities & Azure OpenAI GPT-4
    • September 4: Workflow configuration; support for Notion feeds; document OCR
  • 🎂August 2023
    • August 17: Prepare for usage-based billing; append SAS tokens to URIs
    • August 9: Support direct text, Markdown and HTML ingestion; new Specification LLM strategy
    • August 3: New data model for Observations, new Category entity
  • 🎇July 2023
    • July 15: Support for SharePoint feeds, new Conversation features
  • Graphlit Platform
    • Data API Changelog
Powered by GitBook
On this page
  • New Features
  • Bugs Fixed
  1. August 2023

August 3: New data model for Observations, new Category entity

PreviousAugust 9: Support direct text, Markdown and HTML ingestion; new Specification LLM strategyNextJuly 15: Support for SharePoint feeds, new Conversation features

Last updated 1 year ago

New Features

  • Revised data model for Observations, Occurrences and observables (i.e. Person, Organization). Now after entity extraction, content will have one Observation for each observed entity, and a list of occurrences. Occurrence now supports text, time and image occurrence types. (Text: page index, time: start/end timestamp, image: bounding box) Observations now have ObservableType and Observable fields, which specify the observed entity type and entity reference.

  • Added Category entity to GraphQL data model, which supports categories such as Phone Number or Credit Card Number.

  • Added probability field to model properties, for the LLM's token probability. (See for more detail.)

  • Added error field to feeds. If a feed fails to read from the data source, and is marked as ERRORED state, the error field will have the error description.

  • Support reingestion of changed files from feeds. For feeds, such as SharePoint or Web, where we can recognize that a file or page was updated, we will now reingest the content in-place. Content will keep the same ID, and will restart the content workflow by re-downloading the updated content from the data source. Existing observations will be deleted, and new observations will be created from the updated content.

  • Ingestion of content is now idempotent, meaning if you ingest content again from the same URI, we will reingest the content in-place, while keeping the same ID. (If we can recognize the content has not changed, such as by ETag, we will return the existing content object.)

  • Changed GraphQL data type of SharePoint tenantId, libraryId and siteId to ID rather than String.

  • Performance optimization of entity extraction, and the creation of observations.

Bugs Fixed

  • GPLA-1130: Only was extracting text from first column of PDF tables.

  • GPLA-1140: Text from DOCX tables was not extracted properly.

  • GPLA-1154: Audio content ingested from RSS feed was not deleted when feed was deleted.

🎂
💡
💡
ℹ️
ℹ️
✨
PII
OpenAI documentation