FAQ
If you need to extract main content and image from a web page, this library
helps you:
https://github.com/advancedlogic/GoOse

go get github.com/advancedlogic/GoOse

package main
import (
     "github.com/advancedlogic/GoOse")
func main() {
     g := goose.New()
     article := g.ExtractFromUrl("http://edition.cnn.com/2012/07/08/opinion/banzi-ted-open-source/index.html")
     println("title", article.Title)
     println("description", article.MetaDescription)
     println("keywords", article.MetaKeywords)
     println("content", article.CleanedText)
     println("url", article.FinalUrl)
     println("top image", article.TopImage)}

--
You received this message because you are subscribed to the Google Groups "golang-nuts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to golang-nuts+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Search Discussions

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupgolang-nuts @
categoriesgo
postedSep 9, '14 at 2:31p
activeSep 9, '14 at 2:31p
posts1
users1
websitegolang.org

1 user in discussion

Antonio Linari: 1 post

People

Translate

site design / logo © 2022 Grokbase