Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

A service like this needs booth.

Writing processes in a programming language AND a possibility to reuse them for different stuff.

IFTTT is a nice idea, but I can't do stuff like:

1. check rss feed for new stuff 2. generate links to the real data from the links in the rss. (most RSS feeds just suck, because they don't have the full articles and sometimes they link to crazy pages where the article is split in 10 sub-pages) 3. get all the pages you want the data from 4. parse the relevant stuff out of the pages 5. save the stuff to a place where it won't get deleted again from third party

Most of the time the real stuff isn't easily crawleable without writing a pice of software just for the source.

But you're right, there should be a possibility to get such apps to work without any code.

If I wrote a crawler for images from a DA-artist, I didn't want to rewrite it for every other one...



  Most of the time the real stuff isn't easily crawleable without writing a pice of software just for the source.
  But you're right, there should be a possibility to get such apps to work without any code.
  If I wrote a crawler for images from a DA-artist, I didn't want to rewrite it for every other one...
Shameless offtopic plug: that's exactly what we at http://import.io are trying to solve. We're in Developer Preview, check us out!


Do I get this right, this is not a web-service but an application, which runs on my PC?


The application is for building extractors and connectors, which then get published to our platform so you can query using our web service.


Looks very interesting. I like that it's a desktop app!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: