THIS FORUM IS NO LONGER ACTIVE. Join us on CNCF Slack: https://cloud-native.slack.com/archives/CJFCJHG4Q.
Hi. Could somebody advice on how to setup “trace ID aware load balancing”? I’m trying to setup OTel collector cluster which can sample successful traces, but not sample a trace with atleast one erroneous spaan.
I see this note on tail sampling processor repo “Technically, trace ID aware load balancing could be used to support multiple collector instances, but this configuration has not been tested. “.
I keep reading that it is recommended to run collector as an agent on the host container and additionally we can run collector as a gateway service as well. It tells me that running the collector as an agent is a must and running the collector as a gateway service is optional?
Design doc. Imo, if your all ready have old collector agents in system, like jaeger agents, probably run collector as standalone service is better, you needn’t change every applications
I keep reading that it is recommended to run collector as an agent on the host container and additionally we can run collector as a gateway service as well. It tells me that running the collector as an agent is a must and running the collector as a gateway service is optional?
Design doc. Imo, if your all ready have old collector agents in system, like jaeger agents, probably run collector as standalone service is better, you needn’t change every applications
If you just introduct opentelemetry first time run collector as an agent maybe better, application can report data to nearest collector.
Hey all, looking for some advice on how to structure spans that we have implemented at the transport layer as opposed to application layer. We have TCP load balancer in our edge network that manages incoming connections/certs. We recently implemented 3 different spans: one for the connection establishment (a parent span), another for fetching the cert (child), and finally one for how long to proxy to the corresponding backend service (child).
Given the connection establishment span is at the TCP layer, this makes it difficult to extract context from a client side span since we don't have access to HTTP headers (at least in that moment). Ideally, our client side spans would be the parent of the connection establishment span.
I'm wondering if this is possible, or if our spans should be setup differently. There aren't too many examples online of running distributed traces below the application layer.
Hi all, I'm trying to dynamically turn on/off traces in opentelemetry agent level. I used Dotel.traces.sampler.
java -javaagent:opentelemetry-javaagent-all.jar
-Dotel.traces.exporter=otlp \
-Dotel.exporter.otlp.endpoint=http://localhost:4317 \
-Dotel.otlp.span.timeout=4000 \
-Dotel.resource.attributes=service.name=pet-clinic \
-Dotel.traces.sampler="always_off" \
-jar target/spring-petclinic-2.4.5.jar
But the problem is when I need to turn on(Dotel.traces.sampler="always_on") traces I should stop the process and rerun the command by changing the environment variable. It means I need to restart my application too. Is there any alternative solution to turn on/off traces without restarting the application.
Hello team, I'm wondering how to make use of Link
in grafana query
As current description of Link
A Span may be linked to zero or more other Spans (defined by SpanContext) that are causally related. Links can point to Spans inside a single Trace or across different Traces. Links can be used to represent batched operations where a Span was initiated by multiple initiating Spans, each representing a single incoming item being processed in the batch.
Once we attach Link
to span, how do we make use of it in grafana query?
Suppose I attach a Link
to span A (in traceA ) to spanB (in traceB), can I get result of traceB when querying traceA in grafana?
Hello team we have an issue with our logzio exporter users are getting errors using it from opentelemetry-collector-contrib image version >= 0.24
otel-agent | Error: cannot build pipelines: cannot build builtExporters: error creating logzio exporter: mkdir /tmp: permission denied
otel-agent | 2021/06/07 10:20:51 application run finished with error: cannot build pipelines: cannot build builtExporters: error creating logzio exporter: mkdir /tmp: permission denied
The exporter works fine with version <=0.23.
Does anyone know what could cause those errors from version 0.24? And what can we do to solve it?
Hey guys, when I submit PR with #1973 and running the workflows, something wrong happened with test-coverage:
{'detail': ErrorDetail(string='Unable to locate build via Github Actions API. Please upload with the Codecov repository upload token to resolve issue.', code='not_found’)}
How should I solve this problem?
$ cat /etc/otelcol/otelcol.conf
# Systemd environment file for the otelcol service
# Command-line options for the otelcol service.
# Run `/usr/bin/otelcol --help` to see all available options.
OTELCOL_OPTIONS="--config=/etc/otelcol/config.yaml"
$ cat /etc/otelcol/config.yaml
extensions:
health_check:
pprof:
endpoint: 0.0.0.0:1777
zpages:
endpoint: 0.0.0.0:55679
receivers:
otlp:
protocols:
grpc:
http:
opencensus:
# Collect own metrics
prometheus:
config:
scrape_configs:
- job_name: 'otel-collector'
scrape_interval: 10s
static_configs:
- targets: ['0.0.0.0:8888']
jaeger:
protocols:
grpc:
thrift_binary:
thrift_compact:
thrift_http:
zipkin:
processors:
batch:
exporters:
logging:
logLevel: debug
service:
pipelines:
traces:
receivers: [otlp, opencensus, jaeger, zipkin]
processors: [batch]
exporters: [logging]
metrics:
receivers: [otlp, opencensus, prometheus]
processors: [batch]
exporters: [logging]
extensions: [health_check, pprof, zpages]
$ sudo dpkg -i otelcol_0.44.0_linux_amd64.deb
Selecting previously unselected package otelcol.
(Reading database ... 347245 files and directories currently installed.)
Preparing to unpack otelcol_0.44.0_linux_amd64.deb ...
Unpacking otelcol (0.44.0) ...
Setting up otelcol (0.44.0) ...
Created symlink /etc/systemd/system/multi-user.target.wants/otelcol.service → /lib/systemd/system/otelcol.service.
$ systemctl restart otelcol.service
$ journalctl -u otelcol.service
-- Logs begin at Thu 2022-02-17 13:45:25 IST, end at Thu 2022-02-24 11:28:02 IST. --
Feb 19 18:00:41 voyager2 systemd[1]: otelcol.service: Failed to load environment files: No such file or directory
Feb 19 18:00:41 voyager2 systemd[1]: otelcol.service: Failed to run 'start' task: No such file or directory
Feb 19 18:00:41 voyager2 systemd[1]: otelcol.service: Failed with result 'resources'.
Feb 19 18:00:41 voyager2 systemd[1]: Failed to start OpenTelemetry Collector.
Feb 19 18:00:41 voyager2 systemd[1]: otelcol.service: Scheduled restart job, restart counter is at 1.
Feb 19 18:00:41 voyager2 systemd[1]: Stopped OpenTelemetry Collector.
Feb 19 18:00:41 voyager2 systemd[1]: otelcol.service: Failed to load environment files: No such file or directory
Feb 19 18:00:41 voyager2 systemd[1]: otelcol.service: Failed to run 'start' task: No such file or directory
Feb 19 18:00:41 voyager2 systemd[1]: otelcol.service: Failed with result 'resources'.
Feb 19 18:00:41 voyager2 systemd[1]: Failed to start OpenTelemetry Collector.
Feb 19 18:00:41 voyager2 systemd[1]: otelcol.service: Scheduled restart job, restart counter is at 2.
Feb 19 18:00:41 voyager2 systemd[1]: Stopped OpenTelemetry Collector.
Feb 19 18:00:41 voyager2 systemd[1]: otelcol.service: Failed to load environment files: No such file or directory
Feb 19 18:00:41 voyager2 systemd[1]: otelcol.service: Failed to run 'start' task: No such file or directory
Feb 19 18:00:41 voyager2 systemd[1]: otelcol.service: Failed with result 'resources'.
Feb 19 18:00:41 voyager2 systemd[1]: Failed to start OpenTelemetry Collector.
Feb 19 18:00:41 voyager2 systemd[1]: otelcol.service: Scheduled restart job, restart counter is at 3.
Feb 19 18:00:41 voyager2 systemd[1]: Stopped OpenTelemetry Collector.
Feb 19 18:00:41 voyager2 systemd[1]: otelcol.service: Failed to load environment files: No such file or directory
Feb 19 18:00:41 voyager2 systemd[1]: otelcol.service: Failed to run 'start' task: No such file or directory
Feb 19 18:00:41 voyager2 systemd[1]: otelcol.service: Failed with result 'resources'.
Feb 19 18:00:41 voyager2 systemd[1]: Failed to start OpenTelemetry Collector.
Feb 19 18:00:42 voyager2 systemd[1]: otelcol.service: Scheduled restart job, restart counter is at 4.
Feb 19 18:00:42 voyager2 systemd[1]: Stopped OpenTelemetry Collector.
Feb 19 18:00:42 voyager2 systemd[1]: otelcol.service: Failed to load environment files: No such file or directory
Feb 19 18:00:42 voyager2 systemd[1]: otelcol.service: Failed to run 'start' task: No such file or directory
Feb 19 18:00:42 voyager2 systemd[1]: otelcol.service: Failed with result 'resources'.
Feb 19 18:00:42 voyager2 systemd[1]: Failed to start OpenTelemetry Collector.
Feb 19 18:00:42 voyager2 systemd[1]: otelcol.service: Scheduled restart job, restart counter is at 5.
Feb 19 18:00:42 voyager2 systemd[1]: Stopped OpenTelemetry Collector.
Feb 19 18:00:42 voyager2 systemd[1]: otelcol.service: Start request repeated too quickly.
Feb 19 18:00:42 voyager2 systemd[1]: otelcol.service: Failed with result 'resources'.
Feb 19 18:00:42 voyager2 systemd[1]: Failed to start OpenTelemetry Collector.
lines 1-34