Where communities thrive


  • Join over 1.5M+ people
  • Join over 100K+ communities
  • Free without limits
  • Create your own community
People
Activity
    EmilioMari
    @EmilioMari
    Hi
    I would like to ask some questions about datadr. Is this the right place to do it? thanks
    or is there a forum about datadr?
    hafen
    @hafen
    Hi Emilio - this is the place to ask. It's new and I haven't announced it yet, so you're the first one on!
    EmilioMari
    @EmilioMari
    Hi.t thanks. I am for a family visitar untl noviembre 30th. On my return I will ask you very badis connetion cuestión. regards
    S
    Badis=basic
    My móvil make wrong words. I will contacto vía laptop
    EmilioMari
    @EmilioMari
    Any way the first question is a very basic question. I want to connect to my 15gb file in the hard disk and made a simple. Dim(pus.CSV)
    hafen
    @hafen
    Do you have some example code? dim isn't currently implemented for ddf objects but nrow and ncol are.
    EmilioMari
    @EmilioMari
    Well.I want to know the total rows and total colums of sny unknown CSV file in a folder
    And not find the way to do it with datadr
    EmilioMari
    @EmilioMari
    A real ccode could be
    Cup98<-read.cvv("c:/data/cup98lrn.txt")
    Dim(cup98)
    195412 481
    But with datadr
    Csv
    gc92
    @myeggo
    Hi when follow the http://deltarho.org/docs-install-cluster/ instruction to install Protocol Buffers got error running ./autogen.sh the error message as below. Please help.
    Error message [root@clouderagateway protobuf-2.5.0]# ./autogen.sh
    Google Test not present. Fetching gtest-1.5.0 from the web...
    % Total % Received % Xferd Average Speed Time Time Time Current
    Dload Upload Total Spent Left Speed
    105 1586 105 1586 0 0 19717 0 --:--:-- --:--:-- --:--:-- 154k
    bzip2: (stdin) is not a bzip2 file.
    tar: Child returned status 2
    tar: Error is not recoverable: exiting now
    johncleveland
    @johncleveland
    I am trying to install the full stack on multiple nodes. However, there is a problem with the protocol buffers. It seems the protocol buffers 2.5 .0 version is not available. How do we navigate this situation?
    In fact, I have the same problem as the above user.
    hafen
    @hafen
    You can install protocol buffers 2.5.0 from source by downloading the following: https://github.com/google/protobuf/releases/download/v2.5.0/protobuf-2.5.0.tar.gz
    Looking at http://deltarho.org/docs-install-cluster/#installation, it appears that the link to protocol buffers 2.5.0 is still alive. So following those instructions should work. Please let me know if it doesn't.
    johncleveland
    @johncleveland
    mr. hafen thanks for your prompt response. So I can use wget on the first link instead of the one in the instruction. The instructions work fine until one gets to ./autogen.sh. I will try this link and see what happens, thanks.
    johncleveland
    @johncleveland
    The multinode ubuntu cluster wget instruction for Rhipe yields ERROR 404:Not Found and URI http://www.tessa.io times out (cant find page)
    johncleveland
    @johncleveland
    In the datadr and trelliscope section when performing the instruction set one gets ... Error in library(devtools): there is no package called 'devtools'
    johncleveland
    @johncleveland
    also for the protobuf section there is no target or makefile
    Edmund Walsh
    @ewalsh
    John - Take a look at this: datacratic/protobuf#2 I think that may be the issue
    hafen
    @hafen
    @johncleveland ah sorry I suppose it assumes devtools is already installed. I'll open an issue about that.
    Actually never mind - I just double checked and it does say to install devtools first and provides the command.
    Edmund Walsh
    @ewalsh
    Hello all, can someone please send me a few tips or a pointer to some documentation that can help me split my job into smaller sections. It isn't a lot of data but there are a lot of iterations, I am trying to split that among many vcores. I am hoping to just pass a variable in during the rhwatch call as not all jobs are the same. the option mapred.map.tasks seemed like it would be it but doesn't seem to work for me.
    Many thanks in advance
    bharathi-srini
    @bharathi-srini
    Hi,
    I am trying to run the DeltaRho packages on a Hadoop cluster but I cannot find the Rhipe package - the installation site points to http://ml.stat.purdue.edu/rhipebin/Rhipe_0.74.0.tar.gz and this URL isn#t found.
    http://www.stat.purdue.edu/~sguha/rhipe/dn/Rhipe_0.64.tar.gz downloads the packages but throws an error while calling it in RStudio and reports that there is no package.
    Any help would be appreciated