• Anglický jazyk

"Language Independent Content Extraction From Web Pages"

Autor: R. Chandramma

The rapid development of the internet and web publishing techniques create numerous information sources published as HTML pages on World Wide Web. However, there is lot of redundant and irrelevant information also on web pages. Navigation panels, Table of... Viac o knihe

Na objednávku

36.99 €

bežná cena: 41.10 €

O knihe

The rapid development of the internet and web publishing techniques create numerous information sources published as HTML pages on World Wide Web. However, there is lot of redundant and irrelevant information also on web pages. Navigation panels, Table of content (TOC), advertisements, copyright statements, service catalogs, privacy policies etc. on web pages are considered as relevant and irrelevant content. Such information makes various web mining tasks such as web page crawling, web page classification, link based ranking, topic distillation complex.

  • Vydavateľstvo: LAP LAMBERT Academic Publishing
  • Rok vydania: 2019
  • Formát: Paperback
  • Rozmer: 220 x 150 mm
  • Jazyk: Anglický jazyk
  • ISBN: 9786137328484

Generuje redakčný systém BUXUS CMS spoločnosti ui42.