These are chat archives for FreeCodeCamp/DataScience

23rd
Jan 2017
Tom Lee
@user512
Jan 23 2017 16:19
Hi all, I'm new to web scraping and would like to learn about it. Can you recommend some tools or resources? Thank you.
Russell S. Pierce
@russellpierce
Jan 23 2017 16:21
Depends a bit on your price point and purpose. You can certainly roll your own, but there are also paid tools out there where you can outsource some of the complexity at the cost of losing fine grained control, e.g. https://www.diffbot.com/
Tom Lee
@user512
Jan 23 2017 16:32
Thanks, currently I'm looking simply scraping a static page so I'm looking at some open sources libraries.
Russell S. Pierce
@russellpierce
Jan 23 2017 16:33
Ah, yeah, there are /tons/ of ways to skin that cat. I haven’t done webscraping lately, so I don’t have anywhere to point you in particular. Sorry. :(
Tom Lee
@user512
Jan 23 2017 16:37
No problem, I will check into this channel and see if there's any library I can look into. I will stick with some Ruby/ JS library HTML parser at this point.
Amelia
@apottr
Jan 23 2017 16:43
cheerio is a fantastic js html/xml parser
Tom Lee
@user512
Jan 23 2017 16:45
Nice, by just looking at it, the syntax looks familiar. Thanks @apottr
CamperBot
@camperbot
Jan 23 2017 16:45
user512 sends brownie points to @apottr :sparkles: :thumbsup: :sparkles:
:star2: 1792 | @apottr |http://www.freecodecamp.com/apottr
Alice Jiang
@becausealice2
Jan 23 2017 20:39
Cheerio was super easy to work with, as far as JavaScript goes
Anyone working through the data viz cert in beta?
Amelia
@apottr
Jan 23 2017 20:48
I haven't touched the beta yet, but it looks nice
Alice Jiang
@becausealice2
Jan 23 2017 20:52
Can you get this one to pass? The solution should be .style("font-family", "verdana")
I've refreshed a million times, reset my code as many times, checked all the github issues, read the test cases in FCC's JS code, I can't get it to pass
and the only error in my console is Content Security Policy: Directive ‘frame-src’ has been deprecated. Please use directive ‘child-src’ instead. when I refresh :/
Amelia
@apottr
Jan 23 2017 21:00
yes, I got it to work
exactly the way you said it should be
took a couple refreshes though
Alice Jiang
@becausealice2
Jan 23 2017 21:01
What browser are you using?
Amelia
@apottr
Jan 23 2017 21:02
Chrome 55
on Linux
Alice Jiang
@becausealice2
Jan 23 2017 21:12
Might be a firefox thing
Amelia
@apottr
Jan 23 2017 21:20
could be
yeah, looks like a firefox thing