java - text extraction from web pages -


i'm working on program reads content of page: http://www.pogdesign.co.uk/cat/ , alerts me when 1 of favourite tv series scheduled , on day of month. in program have jframe display months (maybe jtabbedpane) , in each month want list episodes of favourite tv series relative day of month.

i have wrote using "jsoup: java html parser" in order extract text html web page. need understand approach use in order implements these steps:

  1. find day of month when episodes of specific tv series scheduled , save them somewhere.
  2. get a ref links of episodes , use them find broadcast time of each episode , save them somewhere.

so think strategy that? java html parser enough complete program this?

first suggest list of item have title want find, , after use parent() method (jsoup) know relative day in 1 step.

i think can easy jsoup.


Comments