Shaare your links...
285 links
marks Home Login RSS Feed ATOM Feed Tag cloud Picture wall Daily
Links per page: 20 50 100
page 1 / 1
1 results for tags goose x
  • Goose - Article Extractor
    open source Instapaper-like parser...
    Goose was originally an article extractor written in Java that has most recently (aug2011) converted to a scala project. It's mission is to take any news article or article type web page and not only extract what is the main body of the article but also all meta data and most probable image candidate.

    The extraction goal is to try and get the purest extraction from the beginning of the article for servicing flipboard/pulse type applications that need to show the first snippet of a web article along with an image.

    Goose will try to extract the following information:

       Main text of an article
       Main image of article
       Any Youtube/Vimeo movies embedded in article
       Meta Description
       Meta tags
       Publish Date
    Fri 08 Mar 2013 02:24:26 PM CET - permalink -
    - https://github.com/jiminoc/goose
    article extractor goose opensource os parser webDev
Links per page: 20 50 100
page 1 / 1
Shaarli 0.0.41 beta - The personal, minimalist, super-fast, no-database delicious clone. By sebsauvage.net. Theme by idleman.fr.