Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Who's using Vector in production? #790

Open
binarylogic opened this issue Aug 23, 2019 · 48 comments
Open

Who's using Vector in production? #790

binarylogic opened this issue Aug 23, 2019 · 48 comments
Labels
type: announcement A general announcement (pinned issues).

Comments

@binarylogic
Copy link
Contributor

binarylogic commented Aug 23, 2019

Using Vector in production? Let us know in the comments!

PSA

We're looking for companies to work closely with to ensure Vector solves production use cases. If interested please email us at vector@datadoghq.com.

@binarylogic binarylogic pinned this issue Aug 23, 2019
@vectordotdev vectordotdev locked as off-topic and limited conversation to collaborators Aug 23, 2019
@vectordotdev vectordotdev unlocked this conversation Sep 6, 2019
@zcapper
Copy link

zcapper commented Oct 17, 2019

At Checkbox (https://checkbox.ai) we use Vector to ship our system and container logs to S3 and Datadog Logs.

@wowi42
Copy link

wowi42 commented Nov 11, 2019

At Kalvad, we use Vector in production for 3 customers already, sending it to OpenDistro.

Very happy with it

Edit: we switched it to some HTTP logs system based on Elixir and warp10

Edit 2 (2023/04/04): We moved to quickwit with Kafka, it is very stable, and amazingly cheap

@binarylogic binarylogic unpinned this issue Feb 17, 2020
@binarylogic binarylogic added type: announcement A general announcement (pinned issues). and removed meta: announcement labels Aug 6, 2020
@jszwedko jszwedko reopened this Aug 12, 2020
@jszwedko jszwedko pinned this issue Aug 12, 2020
@raghu999
Copy link

At Comcast we are using vector in production for 4 teams with one team handling close to 8TB of ingest/day using vector. We are currently shipping all our logs to Elasticsearch

@suvl
Copy link

suvl commented Sep 17, 2020

At NOS we'll be shipping all our logs from home devices to Kafka using Vector. Project's due to get into production next week.

@andreyvital
Copy link

At Fundamentei—a site focused on providing Stock market financial information for Brazilian investors—we'll be sending system and container logs to Papertrail/S3.

@vectordotdev vectordotdev deleted a comment from isaidashev Sep 19, 2020
@JoaaoVerona
Copy link

At Skiley — a platform that provides an improved experience to users of music streaming services —, I started using Vector (replacing Logstash) to forward logs from journald, gathered from multiple services, to Elasticsearch and S3. It has been a joy, and congratulations for the excellent docs!

@gburd
Copy link

gburd commented Oct 9, 2020

At BlockFi - BlockFi’s vision is to bridge the worlds of traditional finance and blockchain technology to bring financial empowerment to clients on a global scale. - we use Vector in production to ship logs generated by the host (file, journald, etc.) as well as from within containers. We plan to use it for CloudWatch and CloudTrail logs as well and maybe someday for metrics (we use Telegraf today). We ship to Humio for log/metric aggregation/storage/search/dashboards/alerting/etc. This setup replaced Papertail and Prometheus+Grafana.

@vectordotdev vectordotdev deleted a comment from itkovian Oct 19, 2020
@windmgc
Copy link

windmgc commented Nov 6, 2020

At Douban - Douban is a Chinese social networking service website that allows registered users to record information and create content related to film, books, music, recent events, and activities in Chinese cities.

We use Vector in production to collect Terabytes of logs(weblog, MySQL logs, etc) per day and forward them to Kafka and ElasticSearch. And we're also using Vector now to send some web server metrics to Statsd. Vector has proved to be robust and efficient in many cases 👍

@jeromegn
Copy link

Fly.io - App hosting platform running firecracker VMs at the edge.

We use Vector in production to:

  • Transform and send our journald logs to our elasticsearch cluster
  • Capture and transform customer's apps' logs via a unix socket sink (another program sends logs there since Vector doesn't work with named pipes)

We'd love to use it even more! We're looking to replace telegraf and be able to tail from named pipes.

@josegonzalez
Copy link
Contributor

Just here to drop a note that the next version of the Dokku OSS PaaS will include a Vector integration for log shipping.

We chose Vector over other tools for a few reasons:

  • There are a number of integrations available to end users, and as we don't control where Dokku is installed, assuming an installation of a particular solution wasn't going to cut it. We require a flexible solution that continues on with the "batteries included but removable" idea that Dokku was built on, and Vector does this quite handily.
  • Configuring global and app-specific sinks in json is fairly easy, and we managed to distill it to more or less a DSN value. Would be great to have this direct in the core, but the code to support this is easy enough for us to maintain.
  • We needed a tool that integrates directly with Docker. While Dokku supports alternative schedulers such as Kubernetes, most users of alternative schedulers will have other tools to manage logs in their system, and thus we focused on the 80% use case of Docker Local scheduling. In our initial research, this excluded tools such as Filebeat that don't have easy ways to target sinks at specific docker container labels.
  • logspout - the frontrunner from gliderlabs - is fairly unmaintained for a variety of reasons (mostly time). It has some neat features that separate it from vector, but it is better for us to hitch ourselves to well-maintained solutions vs stick our heads in the ground and pretend everything under gliderlabs is the be-all and end-all solution.
  • @binarylogic once sat through an entire dinner with me - and paid for it! - while I berated him about how log shipping was a hard problem and he couldn't do it, so now here I am eating my hat.

Usage docs are here for anyone interesed: http://dokku.viewdocs.io/dokku/deployment/logs/#vector-logging-shipping

@vectordotdev vectordotdev deleted a comment from Shogobg Jan 15, 2021
@vectordotdev vectordotdev deleted a comment from Shogobg Jan 16, 2021
@otisg
Copy link

otisg commented Jan 18, 2021

Sematext now makes use of Vector in Logs Discovery.

@cetanu
Copy link

cetanu commented Feb 2, 2021

Our team within Atlassian began using Vector in production a week or so ago.
We saw some reductions in CPU/Memory usage compared to our old logging agent (fluentd), which is nice.
The main thing we like is being able to perform unit tests on our configuration.
Looking forward to a good WASM interface; we'd like to replace some lua with Rust if possible (Update; VRL suits all our needs especially with Vector 0.22!).

As for volumes, I can't give an exact number (Update: around 16 TB per day), but we're processing most of the traffic at the edge of the Atlassian cloud network... So it's a fair bit. Easily billions of events per hour.
If you use an Atlassian product and the response has a Server header with a value globaledge-envoy, it was logged by Vector 🥳

@KannarFr
Copy link
Contributor

KannarFr commented Feb 9, 2021

Clever Cloud is running Vector on each VM for logs and metrics collection.

@prognant prognant unpinned this issue Apr 9, 2021
@prognant prognant pinned this issue Apr 9, 2021
@jothoma1
Copy link

jothoma1 commented Jun 21, 2021

SIB is using Vector in production for public schools in France. Each school has it's own vector instance for gathering logs then it sends to a central Vector which ships to Elasticsearch and Ceph S3. We were using flowgger before and are really happy with Vector

@ypid-geberit
Copy link
Contributor

ypid-geberit commented Oct 4, 2021

Geberit is using Vector.

I started seriously testing Vector 2021-03. Before that month, essential features were missing. As I had built a log collection pipeline with Logstash for a decent number of log types, I had some ideas how to design this with Vector. So after a POC, I designed a framework for Vector config. I waited before posting here until I had published the framework. This day is today, finally ;-)

You might find the event-processing-framework useful. From what I can tell, it is the first of a kind for Vector. Note that it is somewhat opinionated. I make heavy use of the Elastic Common Schema (ECS) and YAML instead of TOML.

Vector is awesome, keep up the work!

Cc: @aswath86

@prognant prognant unpinned this issue Nov 9, 2021
@prognant prognant pinned this issue Nov 9, 2021
@haimhm
Copy link

haimhm commented Nov 16, 2021

We are at FINAL use Vector A LOT, in several ways:

  • Vector -> Kafka
  • Kafka -> Vector
  • Vector -> Loki
  • Vector -> Main storage

We'll be happy to collaborate with you in order to solve production issues.

@prognant prognant unpinned this issue Dec 10, 2021
@jszwedko jszwedko pinned this issue Mar 11, 2022
@arthurzinck
Copy link

At BedrockStreaming we are using it in production.
As we freshly use it, we use it only to send logs to S3 for now.
I am happy to help fix production issue

@pm5
Copy link

pm5 commented Apr 21, 2022

At ProtonMail and ProtonVPN we are using Vector to connect Kafka->ClickHouse for one of the anomaly-detection systems.

@gschier
Copy link

gschier commented Apr 25, 2022

Railway is using Vector! We use it to send our 20k+ deployment logs to both GCP Storage (for querying) and the filesystem (for streaming). I wrote a post about our general architecture when we first adopted it: https://blog.railway.app/p/building-logs-v2

@darjeeling
Copy link

LINE Corp is using vector in production to deliver tons of logs and metrics !

@jblang
Copy link

jblang commented Jun 24, 2022

DataStax has just rolled out Vector as part of our production logging stack for our Astra DBaaS.

@mari-arondeus
Copy link

UWG has been using Vector to ingest network switch logs for our 3 campuses for over a year. Vector works in tandem with Grafana Loki & Mimir as well as a MinIO cluster, all running on Docker Swarm Mode, to monitor 160 network switches and over 20,000 networked devices.

Vector, in particular, has been absolutely invaluable to us as a sort of "glue" for patching together frustratingly non-standard or otherwise proprietary syslog outputs and formatting them for aggregation and long-term indexed storage.

While our students may not see the work Vector is doing for our public institution on the backend, I'm very proud of what it's allowed us to accomplish on a a budget!

@splitice
Copy link
Contributor

X4B has been using Vector to ingest and process many logs for over a year. Vector (currently 0.22.x) communicates with our Loki stack as the connector between remote edge systems and the central logging system.

@tshepang

This comment was marked as off-topic.

@bencooper222
Copy link

bencooper222 commented Aug 23, 2022

@convoyinc we use it for our logging pipeline from K8s to ES, S3 and Datadog Metrics :)

@limick
Copy link

limick commented Sep 13, 2022

Upsolver uses Vector internally to ingest metrics into ClickHouse. It's very stable, performant and has an amazing range of features for a relatively young project.

@Sh4d1
Copy link
Contributor

Sh4d1 commented Sep 22, 2022

Scaleway uses Vector to collect, transform, and send all the logs of the Scaleway S3 platform! Many thanks again for this product and the community behind, very kind and reactive ❤️

@shurup
Copy link

shurup commented Dec 9, 2022

Deckhouse Kubernetes Platform uses Vector in its log-shipper module. It means that Flant alone maintains hundreds of K8s clusters for various customers that rely on Vector to ship their apps' logs.

@massyah
Copy link

massyah commented Jan 7, 2023

Displayce uses Vector to send, process and route all the logs to a self hosted Loki instance. Thanks for this product!

@protochron
Copy link
Contributor

Cosmonic uses Vector to ship logs and metrics to various systems. We're currently using the NATS, Prometheus, ElasticSearch and Clickhouse integrations among others.

@xunleii
Copy link
Contributor

xunleii commented Mar 30, 2023

At RadioFrance - the French national public radio broadcaster - we are shipping all logs via Vector for over 6 months in production (mainly Kubernetes and some generated by hosts and our CDN, ~1.2TB per day).
Many thanks for this product !

@danielhoherd
Copy link

danielhoherd commented Apr 20, 2023

I work at Astronomer.io (https://github.com/astronomer) and when we implemented sidecar logging for airflow in kubernetes, we chose vector over fluentd, fluentbit, etc.. We first started using it exactly a year ago.

@honganan
Copy link
Contributor

honganan commented Aug 4, 2023

We Tuya Smart are using vector to collect and transform logs. Especially transformation saves half of resources than our old used tool, and the vrl remap is really good to use. On the whole, It's amazing!

@karuppiah7890
Copy link

We at Togai use Vector in production to collect logs

@mr-karan
Copy link

mr-karan commented Aug 17, 2023

We at Zerodha are happy users of Vector. Have written about our setup here: https://zerodha.tech/blog/logging-at-zerodha/

@ShivanshMathurJuspay
Copy link

We at Hyperswitch are using Vector for a lot of use-cases including pushing logs from Envoy/Squid to logging service , Ingestion pipeline , etc.

@ncalibey
Copy link

At Timescale we use Vector shipping logs to customer and scraping k8s, cgroup, and storage data!

@spnngl
Copy link

spnngl commented Sep 6, 2024

At Wiremind we use vector for collecting logs on multiple kubernetes clusters.

@nzxwang
Copy link

nzxwang commented Sep 11, 2024

At Riot Games, we use vector to build an observability pipeline for collecting, processing and forwarding (to Datadog) almost of all our company's telemetry. We also love the added benefit of having out of the box monitoring for our observability pipeline itself, and tooling like vector tap.

@ramseymcgrathfd
Copy link

ramseymcgrathfd commented Nov 12, 2024

At Fanduel we run Vector:

  • As a parser to enrich route53 logs with their resolver endpoints (s3 integration pulling from a ram shared query log)
  • Intake and processor for audit logs from a couple services that have no other datadog integration (sidecar pod that sends logs to an http listener)
  • As a generic tcp endpoint for network device logs to pre-process logs into their integration's format (network listener behind an nlb)
  • The main intake for our kubernetes-based datadog agent logs/metrics/trace to properly tag services (and filter out some unneeded ones) with an NLB behind a vpc endpoint that we share to our eks VPCs.
    We can push close to 1M events/sec ingested

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type: announcement A general announcement (pinned issues).
Projects
None yet
Development

No branches or pull requests