Where communities thrive


  • Join over 1.5M+ people
  • Join over 100K+ communities
  • Free without limits
  • Create your own community
People
Repo info
Activity
  • 13:53
    igorbrigadir opened #511
  • 13:53

    igorbrigadir on counts-starttime

    fix --archive in counts command… (compare)

  • Jul 24 21:02
    igorbrigadir opened #510
  • Jul 24 20:59

    igorbrigadir on twarc1-user_lookup-fix

    support using list of integer i… (compare)

  • Jul 24 20:13
    igorbrigadir commented #488
  • Jul 24 19:53
    igorbrigadir opened #509
  • Jul 24 19:50

    igorbrigadir on dehydrate-command

    add dehydrate command (compare)

  • Jul 19 00:41
    igorbrigadir synchronize #488
  • Jul 19 00:41

    igorbrigadir on batch-compliance

    better error message for downlo… format and remove unused update… (compare)

  • Jul 19 00:31
    igorbrigadir synchronize #488
  • Jul 19 00:31

    igorbrigadir on batch-compliance

    add dehydrate command for extra… add update_to for time progress… compliance job get and list and 5 more (compare)

  • Jul 18 17:19
    igorbrigadir commented #508
  • Jul 18 17:18
    igorbrigadir commented #508
  • Jul 18 14:49
    numeroteca commented #508
  • Jul 18 12:11
    igorbrigadir commented #508
  • Jul 18 10:58
    numeroteca opened #508
  • Jul 17 14:36
    edsu closed #505
  • Jul 16 23:53
    igorbrigadir commented #505
  • Jul 16 23:49
    igorbrigadir closed #507
  • Jul 16 23:49
    igorbrigadir commented #507
Ed Summers
@edsu
the first time you run it, it would get as many tweets as it can and write them to /mnt/tweets/ferguson/tweets-0001.json
Renato Gabriele
@remagio
miscmatch between our discussion and source ;-)
Ed Summers
@edsu
the next time you run it it will look at the first tweet in tweets-00001.json and use that twitter id as the minimum id to archive, and write the tweets as /mnt/tweets/ferguson/tweets-0002.json
that's what i was thinking anyway
i didn't like the way the old twarc named files
Renato Gabriele
@remagio
right
I think it works and ppl will like this
Ed Summers
@edsu
ok let me see if i can get it working
maybe you can try it out :-)
Renato Gabriele
@remagio
:-)
Anyway, THX!
You contributions in this kind of stuff is great
Have a good day @edsu
Ed Summers
@edsu
hopefully i'll have something for you to try in an hour
thanks for working w/ me on it!
Renato Gabriele
@remagio
You are welcome
We could discuss about nosql integration soon if you want
Ed Summers
@edsu
super to have some in italy trying it out btw -- makes a big difference to have it not just be a US thing
sure
some -> someome
someone!
Renato Gabriele
@remagio
I see, but we have ridiculous "numbers" comparing with US
I mean, media say "Incredible, Italian twitter scene crazy for next Presidente"
But it's less than 200k tweets most bot or smm with botnets.
Ed Summers
@edsu
:-) yeah, but i think it's interesting how social media is being used in different cultures/contexts
Renato Gabriele
@remagio
Sure, here the hype is very high only because we have some journalists that are at the same time: journalist+SMM+Political consultants. And are leader in TV&Papers. So every media say everyday: People are creasy on Twitter for blah blah blah
And usual are only 30k-50k tweets
But got prime rate news on TV.
Ed Summers
@edsu
i see yeah
Renato Gabriele
@remagio
Here comes twarc
;-)
plus other tools.
I lived in a few country but I think Italy, by example, is still in the early phase of analysis of socials.
I mean It's still used to support outside twitter points.
Ed Summers
@edsu
i think we all are still figuring out what social media means ; particularly in the political arena
Renato Gabriele
@remagio
yes, here I think there will be lots to learn b/c the entire smm community is involved in.
politics.
But happen something not possible in US.
The same persons create opinions using CnC plus consultants plus volunteer.
Ed Summers
@edsu
quick question: would it be ok if this search_archive.py wrote the data as gzip files?
or would it be easier if they are uncompressed to start with?
Renato Gabriele
@remagio
uncompressed better
Ed Summers
@edsu
ok
Renato Gabriele
@remagio
so using pipe it can be checked
Ed Summers
@edsu
zcat is nice on linux
zcat file.gz | grep foo
Renato Gabriele
@remagio
not yes used in bash scripts
Ed Summers
@edsu
but i'll leave it uncompressed for now
Renato Gabriele
@remagio
not yet
Ed Summers
@edsu
ok
Renato Gabriele
@remagio
but it could be when you think it's fine