Skip to content
@Sourcery-info

Sourcery.info

Sourcery.info

Your private, secure investigative journalism AI

Introduction

Sourcery.info lets you interview a cache of documents using natural language, all within a secure environment. It helps you find answers to questions you didn't know to ask.

Large language models combined with source documents has created a potential treasure trove for investigative journalists, OSInt researchers, or anyone who needs to glean insights from a document cache. However, the process typically involves uploading the documents to third parties, which is inherently insecure and in many cases dangerous.

With Sourcery.info, you run your own large language model, process the documents locally, and query them locally. You can run it on your own network, on your PC, on a virtual PC, or even in an air gapped environment.

Sourcery.info uses an AI methodology called retrieval-augmented generation (RAG) to combine both your own unique knowledge base (the document cache) with the power of a large language model.

Apart from the ability to search through documents using natural language, the RAG-based AI method has the potential of surfacing interesting, useful information to researchers who don't know exactly what they're looking for. This can be particularly useful for large caches of document leaks, for instance.

Sourcery.info takes the complexity out of RAG by optimising the generation of embeddings (the LLM codes that turn your documents into vectors), deciding on chunk size, using strategies like small-to-big, and selecting the appropriate language model for your use case.

The user interface is simple without losing the power to inspect the documents manually. It includes features to ingest documents, create the embeddings, query the documents using natural language, and view the original sources along with the results.

It will:

  • Ingest a cache of documents;
  • OCR and extract the data as necessary;
  • Generate embeddings locally;
  • Store the embeddings in a local vector database;
  • Use local LLMs in a RAG model to query the documents using natural language;
  • Present results with the associated sources for easy reference to the sources.

It won't:

  • Send any information out of the system;
  • Load any external dependencies;
  • Censor or refuse to answer questions that could be sensitive;
  • Store any usage information, unless required for auditing purposes (with a specific "opt-in").

Current State

Sourcery.info is currently in its early development phase. It is not yet ready for beta testing, although we are processing some documents on behalf of journalists.

Popular repositories Loading

  1. sourcery-info-homepage sourcery-info-homepage Public

    HTML

  2. sourcery-frontend sourcery-frontend Public

    Your private, secure investigative AI. This is the frontend for the Sourcery.info project.

    TypeScript

  3. .github .github Public

  4. sourcery-info sourcery-info Public

    TypeScript

  5. open-webui open-webui Public

    Forked from open-webui/open-webui

    User-friendly WebUI for AI (Formerly Ollama WebUI)

    Svelte

  6. LightRAG LightRAG Public

    Forked from HKUDS/LightRAG

    "LightRAG: Simple and Fast Retrieval-Augmented Generation"

    Python

Repositories

Showing 6 of 6 repositories
  • Sourcery-info/sourcery-info’s past year of commit activity
    TypeScript 0 0 0 0 Updated Nov 2, 2024
  • LightRAG Public Forked from HKUDS/LightRAG

    "LightRAG: Simple and Fast Retrieval-Augmented Generation"

    Sourcery-info/LightRAG’s past year of commit activity
    Python 0 MIT 810 0 0 Updated Oct 25, 2024
  • open-webui Public Forked from open-webui/open-webui

    User-friendly WebUI for AI (Formerly Ollama WebUI)

    Sourcery-info/open-webui’s past year of commit activity
    Svelte 0 MIT 5,622 0 0 Updated Sep 26, 2024
  • Sourcery-info/sourcery-info-homepage’s past year of commit activity
    HTML 0 0 0 0 Updated Sep 26, 2024
  • sourcery-frontend Public

    Your private, secure investigative AI. This is the frontend for the Sourcery.info project.

    Sourcery-info/sourcery-frontend’s past year of commit activity
    TypeScript 0 0 0 0 Updated Mar 23, 2024
  • .github Public
    Sourcery-info/.github’s past year of commit activity
    0 0 0 0 Updated Mar 19, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…