Questions tagged [prometheus]

For questions about Prometheus, an open-source systems monitoring and alerting toolkit.

About Prometheus (from prometheus.io):

Prometheus is an open-source systems monitoring and alerting toolkit originally built at SoundCloud. Since its inception in 2012, many companies and organizations have adopted Prometheus, and the project has a very active developer and user community. It is now a standalone open source project and maintained independently of any company. To emphasize this and clarify the project's governance structure, Prometheus joined the Cloud Native Computing Foundation in 2016 as the second hosted project after Kubernetes.

93 questions
11
votes
2 answers

How to calculate disk space required by Prometheus v2.2?

We are trying to calculate the storage requirements but is unable to find the values needed to do the calculation for our version of Prometheus (v2.2). Prometheus(v2.2) storage documentation gives this simple formula: needed_disk_space =…
MCoetzee
  • 113
  • 1
  • 1
  • 7
3
votes
1 answer

Prometheus and Azure WebApps: How to monitor multiple instances with prometheus

I have an application that has two instances running and each request runs on a different machine. I am monitoring an event with a counter in my application and each instance runs it's own counter using prometheus-net. It would be great if…
2
votes
1 answer

Does Prometheus log scrape-duration?

I'd like to graph scrape duration for a particular target. The Prometheus web interface has a Status of Targets page, which tabulates the (time elapsed since) "last scrape" and corresponding "scrape duration". Does Prometheus itself publish metrics…
benjimin
  • 289
  • 1
  • 12
2
votes
1 answer

Prometheus - join series based on the key part of a label

In 1.16 k8s deprecated the label kubernetes.io/role: worker I had previously been using as a filter to target graphs to specific node role using this in my grafana variables declaration label_values(kube_node_labels, label_kubernetes_io_role) #join…
Brando___
  • 101
  • 1
  • 6
1
vote
1 answer

Add prometheus config for all jobs

In my prometheus.yml I have several jobs defined like: - job_name: alpha scheme: 'https' metrics_path: '/metrics' static_configs: - targets: - one.example - two.example relabel_configs: - source_labels:…
Vietna
  • 25
  • 5
1
vote
1 answer

How to append namespace before metric name in prometheus?

I have static config in prometheus.yml - job_name: 'myjob' static_configs: - targets: ['monitoring.api.endpoint.net'] And I recieve various metrics for example cpu_usage, disk_write_ops, etc. Is it possible to put them all into…
0
votes
1 answer

Prometheus/BlackBox Exporter probe_success 0 but service still running

I'm currently utilizing Prometheus with Blackbox Exporter for monitoring our services. However, I've encountered an issue where one of our external services is showing probe_success as 0, despite being accessible when tested with curl. All my other…
0
votes
1 answer

List all active promtail targets

I have spend a significant time reading the manual, but I still cannot figure out how to actually get a list of the targets that promtail is currently tailing. The metrics endpoint only exposes aggregate numbers. How can I list all the targets…
MrMeeSeeks
  • 11
  • 1
0
votes
1 answer

How to purge / prevent using the prometheus local storage when using remote_write

I configured Prometheus to push metrics using remote_write but the local storage is still in use and keeps growing How could Prometheus be configured to stop storing locally and only push metrics or to minimize the retention policy from the local…
nbari
  • 101
  • 1
0
votes
1 answer

Scraping different Prometheus metrics at different rates

I am running a Tendermint network. I am using Prometheus to track various metrics, and to track node health. For performance metrics such as tendermint_consensus_total_txs, I want updates every second. For health metrics such as…
Ethan Reesor
  • 103
  • 3
0
votes
1 answer

Node-centric network metrics are very different from the sum of metrics of the Pods running on that node

Context I am trying to chart the network bandwidth usage of a node in 2 different manners: By looking at global metrics for that node By summing up the corresponding metric for each Pod To achieve this, I am issuing the following Prometheus…
-1
votes
1 answer

Enterprise Ready Prometheus

I am new to Prometheus and have been reading about its popularity as a tool for monitoring applications, containers and K8s objects. However, most of the blogs that I have come across have at best done a very rudimentary introduction to using…
Gaurav Parashar
  • 187
  • 1
  • 4
-1
votes
1 answer

K8s to Prometheus apiserver_client_certificate_expiration_seconds_count metric, why does it count more seconds than 86400 for a single day?

In the 2 pictures below I am comparing the metric from K8s apiserver_client_certificate_expiration_seconds_count{job="apiserver"} for my 3 master nodes on K8s 1.14.3. A single day should have 86400 seconds, how come at the start of the day for a…