Back to browse
GitHub Repository

Free open source OSINT platform for gathering, ingesting, and analyzing open-source intelligence data

12 starsPython

Alexandria, open source news aggregation and classification suite

by RicDan·Mar 26, 2026·6 points·0 comments

AI Analysis

MidNiche GemShip It

Functional pipeline with Neo4j graphs, but training system and GPU support aren't built yet.

Strengths
  • Modular Docker compose architecture with RabbitMQ enables flexible pipeline configuration.
  • Custom label and relation schema lets users define their own NLP classification targets.
  • Neo4j graph database visualizes entity relationships with temporal decay controls.
Weaknesses
  • Author admits training pipeline doesn't exist yet — only manual labelling is implemented.
  • NLP runs CPU-only with no GPU acceleration, making processing noticeably slow.
Category
Target Audience

Researchers and journalists tracking news sources and entity relationships

Similar To

Feedly · Inoreader · Maltego

Post Description

I've always felt it a bit hard to grasp how far news reach and all. Because I'm also a bit behind on python, I decided to build something from scratch.

It's really simple: docker compose, micro services connected to some very basic data pipelines using rabbitMQ, and some local NLP. The NLP portion is a bit slow as it's pure CPU and doesn't leverage GPU/NPE if you have them.

The tool offers you to define your own "labels" and "relations", which are then picked up by the local NLP models to assign it to news articles. Standard, there are only a few included.

But the goal is to have your own personal news aggregator, and adding/changing news sources (if they have a rss feed) should be pretty easy through the docker compose file.

Similar Projects

Security●●●Banger

My OSINT dashboard with 60 feeds now has a pseudonymous P2P testnet

Aggregates 60 OSINT feeds into one map with pseudonymous P2P comms built in.

Rabbit HoleDark Horse
vancecookcobxin
501mo ago
Security●●Solid

Lumina – passive OSINT recon tool for domains

Pulls together passive sources — crt.sh, Wayback, GitHub search, Shodan and Hunter — into a single HTML+JSON output so you can run recon without touching the target. It isn't reinventing OSINT, but the combination of multi-source subdomain enumeration, built-in WHOIS/JSON export and a ready-to-share dark report plus Docker support makes it an immediately useful tool for quick triage.

Niche GemShip It
surfruit
203mo ago