Good morning friends, I have a small project in mind as part of a bigger one and I need to pull some data automatically in order to automate some functions and save some data entry.
I found a very nice Php library called
Goutte in which just all the other libraries the filtering for finding the data in certain nodes would rely on CSS selectors and the attributes of the elements such as ID and classes.
The problem is that one of the pages that I am trying to get the content from has a really old structure, they don't use CSS, nor IDs nor classes for their elements. Everything is styled via deprecated attributes on each element.
So basically I couldn't find a standard and easy way to reach the data I want and most probably.
Anyone has ideas or suggestions for such a case? Please feel free to share your thoughts. Thanks! :)
P.S. This is an example of a page that I mean
http://www.ticketingboxoffice.com/ check its source, in this one they wrapped the description of an event within a div with an id : 'lay1' yet the ID is repeated across the document :P