Where communities thrive


  • Join over 1.5M+ people
  • Join over 100K+ communities
  • Free without limits
  • Create your own community
People
Repo info
Activity
Adrian Cole
@adriancole
even if I could I wouldn't be the one to troubleshoot why this query is slow, it can be slow for many reasons including not enough memory, overload, too much data in that query etc
my goal has been to get you a solid idea of the slow query, that's as much as you can expect here
underprovisioned clusters, services who do not sample at all, or who add too many tags or logs, or have variables in names can all result in slow, and 15s is really slow.
if you are sending 100% data at any cluster you can kill it
this is why ex in brave we have things like rate-limited samplers
Ucky.Kit
@uckyk
Ok,i got it, but how can I find 100% uploaded in the server
Adrian Cole
@adriancole
can you rephrase your question? I don't understand what you mean
Ucky.Kit
@uckyk
"if you are sending 100% data at any cluster you can kill it"
my question is how to find that server in my zipkin-server which sending 100%data
At present, I suspect that the IIS of ES exceeds the maximum number of connections
Adrian Cole
@adriancole
@uckyk you can look at /metrics
you can also use prometheus, but this only shows the rate received not the rate sent
if using java the zipkin-reporter-java project has micrometer integartion for sender metrics
Ucky.Kit
@uckyk
Yes, yes, I remember that thx~:D
Adrian Cole
@adriancole
np
Ucky.Kit
@uckyk
image.png
Finally, I query the log and find that it is here. If I use aggregation, I will not find the result. Throw timeout.
It's OK to delete the code in red box .
What is the problem? Isorry to ask you something other than Zipkin
Adrian Cole
@adriancole
@uckyk this aggregation is to return the ordered trace IDs that match the query. the GET /api/v2/traces endpoint won't work without it
but anyway since you've isolated this you now have a way to troubleshoot your data indepedent of zipkin
when you can do that http command without timeout, then the UI has a chance to operate. you can see if a shorter duration ever completes
ex the "from" and "to"
QUERY_LOOKBACK defaults to 1 day in millis: 86400000
Ucky.Kit
@uckyk
yes, u are right, Now the question is just about this aggregation. Can you give me your opinion on what causes it?
Adrian Cole
@adriancole
too much data to aggregate can cause something ES can never return
I don't run ES I don't know the troubleshooting tools people use to look at query planner etc
you may be able to find techniques on their forum or docs
most times when someone comes in asking they aren't even running a cluster
just one overloaded JVM for elasticsearch
Ucky.Kit
@uckyk
OK, thanks a lot
Adrian Cole
@adriancole
beyond that I don't know how people in general run elasticsearch and debug performance problems. I do know some sites have a team just for elasticsearch (unrelated to zipkin)
Ucky.Kit
@uckyk
I'll consult our operation and maintenance colleagues, which may help~
Adrian Cole
@adriancole
cool. if you learn any tricks, do pass back. next person might have question
Ucky.Kit
@uckyk
OK~~
Aditya Mukhopadhyay
@adityamukho
Hi, I am working with a "synchronous" javascript environment (modified v8 engine inside ArangoDB server). I need to use a tracer for my application that will run on this engine. Async code will not behave properly here (https://www.arangodb.com/docs/stable/foxx.html#compatibility-caveats)
They have provided stubs for setTimeout, setInterval, clearTimeout and the like, so the code will not entirely break.
I was wondering how Zipkin's js client would work in this environment (if it worked at all). Does it depend on an event loop and timers to periodically report its collected spans? Is there a way to disable this behaviour if so? ArangoDB does have a 'Tasks' API that lets you run async code (similar to a cron), but I'm not sure if I can plug in a custom reporter to use this.
Please help me understand if and how I can integrate Zipkin in this setting.'
rajat singh
@rajatsingh998
hey, I am new to this project, I just built the source successfully. Now I want to know at what url this is running in my local. Thanks in advance
Abhishek Nalin
@abhisheknalin
Hi, I wanted to understand Zipkin better. I went through few pages of documentation at zipkin.io. There are a few Readme.md files in the source code. Is there any other source of documentation for Zipkin?
Jorg Heymans
@jorgheymans
@rajatsingh998 it would normally be running at http://localhost:9411/zipkin , any reason you're building from source ?
the quickstart is a better way to get up and running: https://github.com/openzipkin/zipkin/#quick-start
@abhisheknalin best is to start from the github root https://github.com/openzipkin/zipkin and start browsing from there. If you're interested how the server part works and is configured : https://github.com/openzipkin/zipkin/tree/master/zipkin-server , if you're interested in instrumentation more then https://github.com/openzipkin/brave/tree/master/brave and more in detail here https://github.com/openzipkin/brave/tree/master/brave .
the documentation is contained in each README.md of the respective modules
so by browsing https://github.com/openzipkin/ and its subprojects you will find what you need
rajat singh
@rajatsingh998
@jorgheymans thanks. Naah not any specific reason, just trying that way
Abhishek Nalin
@abhisheknalin
@jorgheymans Thanks a lot.
Jorg Heymans
@jorgheymans
@rajatsingh998 @abhisheknalin no worries
kristen
@kristen42170622_twitter
Hi, I have a question regarding http.status_code, according to the documentation, http.status_code should only have values if the API request is "not success", but in our zipkin data, there are many cases of http.status_code being 200, have I misunderstood the docs?
pnilesh1
@pnilesh1
Hi i am using tracing with envoy proxy. Is there any way we can disable tracing in envoy? I know there is x-b3-sample is there but my client wont be able to send this header.
José Carlos Chávez
@jcchavezs
@kristen42170622_twitter could you please point out that documentation? http.status_code is always there on a HTTP call.
@pnilesh1 better to check in envoy docs.