module WebParser::Doc

Public Class Methods

included(base) click to toggle source
# File lib/web_parser/doc.rb, line 5
def self.included base
  base.extend ClassMethods
end
new(doc, parser=Nokogiri::HTML) click to toggle source

Creates a new page parser @param [String] doc Nokogiri object with page we would parsing

# File lib/web_parser/doc.rb, line 39
def initialize doc, parser=Nokogiri::HTML
  @doc = parser.parse(doc)
  raise XPathsNotSet, "no recipes defined!" unless self.class.recipes
end

Public Instance Methods

parse() click to toggle source

Main method for parsing document @return [Hash] Parsed informations from page in a hash

# File lib/web_parser/doc.rb, line 46
def parse
  get_parsed
end

Private Instance Methods

get_parsed() click to toggle source
# File lib/web_parser/doc.rb, line 52
def get_parsed
  self.class.recipes.apply(@doc)
end