Where communities thrive


  • Join over 1.5M+ people
  • Join over 100K+ communities
  • Free without limits
  • Create your own community
People
Activity
    Mek
    @mekarpeles
    I suggest catching up with Drew Winget whose doing work w/ annotations!
    Gerben
    @Treora
    is Drew here?
    Mek
    @mekarpeles
    I think he's just finishing the tour (eta 10m)
    I'm contacting Drew now -- he'll jump on gitter
    Ben Welsh
    @palewire_twitter
    @jjjake I'm noticing that after an item with a couple images attached as files is uploaded the URLs to the files on archive.org are not always immediately available. Am I screwing something up or is that expected behavior?
    @palewire_twitter can you send me an identifier as an example? The files should be available pretty shortly after they've been uploaded, but maybe not immediately.
    Ben Welsh
    @palewire_twitter
    pastpages-reuters-46259-3539759
    They seem to show up pretty quickly. So they could be there by the time you look.
    After I upload an item I'm syncing the newly minted IA id in my local database. I'd also like to sync the URL of items alongside it. But if I retrieve the item via the API immediately after upload I find it's often not there yet.
    I could write a task to "go behind" a bit later and fill in the blanks, but I'd rather not.
    Would it be safe to "assume" the URLs will be there with a reliable namespace?
    aeschylus
    @aeschylus
    @Treora
    Gerben
    @Treora
    yes joining now
    ehm well, it looks closed?
    I get a login screen
    aeschylus
    @aeschylus
    Eh. Weird. I'm private messaging you. Let's try some other service
    Benjamin De Kosnik
    @bdekoz
    Was
    jjjake
    @jjjake
    @palewire_twitter Yes, it is safe to predict the URL for files: https://archive.org/download/<identifier>/<filename>
    or https://archive.org/download/<identifier>/<path>/<to>/<my>/<file>
    Does that help?
    Ben Welsh
    @palewire_twitter
    sure does. I was reluctant to bank on that. But if you say so, I'l just skip saving the URLs into my database.
    Mark Graham
    @MarkGraham_twitter
    Here is some code (in development) to show differences in the content of two URLs (in the case of the code right now WARCs)
    https://github.com/harvard-lil/warc-diff-tools
    Omar Rizwan
    @osnr
    @MarkGraham_twitter cool!
    Mark Graham
    @MarkGraham_twitter
    :-)
    Linda
    @lpatmo
    Hello!
    Jono Brandel
    @jonobr1
    hi Linda
    Mark Graham
    @MarkGraham_twitter
    Here is a URL for a ScreenShot of a Wayback Machine Playback page: https://web.archive.org/web/20160904103421id_/http://web.archive.org/screenshot/http://iskme.org/
    Jono Brandel
    @jonobr1
    WOAH
    so cool
    Dr. Neal Krawetz
    @hackerfactor_twitter
    Very cool!
    Ben Welsh
    @palewire_twitter
    Wow. Is that the new screenshot endpoint, @MarkGraham_twitter ?
    Richard Caceres
    @rchrd2_twitter
    It looks like it finds the nearest screen shot based on the timestamp. There are a lot less screenshots than there are captures. So we still might want a different service that generates a screenshot from the wayback machine.
    Mek
    @mekarpeles
    Hey @Treora! Want to join the zoom again?
    I think we have our ducks in a row
    Step 1) Get the whole Manual for Civilization collection mirrored digitally
    Richard Caceres
    @rchrd2_twitter
    Hello. Wayback machine prototype in https://gitter.im/ArchiveExperiments/Room-B
    Omar Rizwan
    @osnr
    useful docs while i'm messing around with wayback stuff:
    Richard Caceres
    @rchrd2_twitter
    Hi Thanks for the links
    Here's our repo. It's still a skeleton though: https://github.com/ArchiveLabs/wayback-coverflow
    Gerben
    @Treora
    aeschylus: you around again, working on anything?
    Mek
    @mekarpeles
    I haven't seen him
    Omar Rizwan
    @osnr
    have some basic image diff working :-) https://github.com/osnr/waydiff
    Mek
    @mekarpeles
    Dude!
    @osnr awesome
    Omar Rizwan
    @osnr
    one thing which would be useful btw would be to add CORS support to as many of the APIs as you can :-)
    tmad4000
    @tmad4000
    Database of People and Their Interests -- add yourselves! https://ideapad.io/internet-archive-hackathon/people
    Mek
    @mekarpeles
    Thank you everyone for making this a wonderful event and for bearing with our attempts at organization!
    A lot we can improve in the future, I hope people felt included and weren't too blocked!