you can see it by cmd-click a few times in intellij
yeah, didn't dig that deep
my 2nd run with aggregateByKey(Aggregator.sortedReverseTake) is actually slower, i don't think there's significant diff in the actual algo between the 2
Hi, I'm new to Scio and dataflow. What is the recommended way to run Scio pipelines in production? Do you guys recommend using sbt "runMain ...." approach? or using the bash script generated by sbt-pack plugin? target/pack/bin/word-count --project=...
Hey ya'll, I'm trying to run tests with dataflow with PubSubIO and the pUb/Sub emulator. Any chance scio does an integration test like this? Having a really hard time figuring out why pubSubIO cannot connect to my emulator
Hello! I have been searching for a single example of how com.spotify.scio.values.SCollection readFilesAsString is used. Any example would be greatly appreciated
I have a quick question about schema evolution let’s say day 1 in my gcs file I have 5 columns and I populated to big query by creating five column table and then on day 5 my files had 7 columns do I need to manually add extra 2 columns in my big query or is there a way in Scala scio to add extra two columns in target and finish the process ?