Scripting: Need help for html parser !

uriel · March 31, 2008, 3:53am

Hi, looking for a solution who does the job, i lost myself on so many answers about it !

i want to show an local html document into the script window, i found how to call external document but i need to parse elements: texte/img/lines and columns(or position and width).

So i decided to ask for help, maybe a generous script writer can explain me how it’s work and where to find the essential structure for correct parser !

I use to work on UIdesign, but this time, it’s a challenge for me

bye, hope to find someone here !
uriel

HouseArrest · March 31, 2008, 4:53am

not for nothing, but this is why i suggest learning the language before you try creating scripts that “just barely do the job”. Read this link. You should be able to then apply what you know about the internal workings of blender (sorry can’t help you there as I haven’t seen the source) to get what you want.

IanC · March 31, 2008, 5:16am

http://blog.ianbicking.org/2008/03/30/python-html-parser-performance/

You may just want to look at including an external renderer. I’m sure there’s a web-browser example with python somewhere.

forTe · March 31, 2008, 5:23am

HTML is a little overkill if all you want to do is show positioned text and images. It’d be much easier , if you created your own format and parsed that. Something like:


T(20, 30):Hi Blender!
I(40, 60): /Users/Me/MyImage.jpg
T(100, 20): This is my image!

Where T would be text that you could output to the window, and I would store the location of an Image you want to display. The part in parentheses would store its location. Then all you have to do is read in the created file, scan through the lines, and take the appropriate parts of each string and apply the action you want. You could add more things like text color easily (even things like padding and background color wouldn’t be horribly hard).

As HouseArrest’s link shows you (if you go to the next page), Python has built in parser capabilities (it has an html parser as well).

To really be able to use what you get out of any HTML parser, though, you’re going to have to implement a DOM model, write lots of rules, define lots of behaviors, and be quite comfortable with Python and the API. This is not an easy task at all, and quite frankly, Blender does not easily have the capabilities to do everything you need visually (like display fonts that aren’t the built in bitmap one on its script window, which means your stuck with a couple sizes of text). I suppose in theory it does have the capabilities, though, if you really know what you’re doing, and you can put in the work to figure stuff out.

For right now, unless your really in bed with OpenGL, Python, and the API I’d strongly suggest making your own file format like above. Its going to save you so many headaches over trying to implement an HTML viewer that follows all the little rules and such of rendering HTML.

Edit: An external renderer could work, but it depends on whether its drawing to hardware or software, and whether or not you can get access to the buffers its drawing too, and then translate them to be drawable with the API or somehow blit it onto the script window.

Nyrath · March 31, 2008, 9:52am

http://www.crummy.com/software/BeautifulSoup/

BeBraw · March 31, 2008, 10:41am

One interesting to handle it would be to use YAML. See http://www.yaml.org/ and http://pyyaml.org/ for reference. It is extremely simple to read and use. I warmly recommend it. You can find some examples of usage at http://pyyaml.org/wiki/PyYAMLDocumentation .

uriel · April 2, 2008, 3:38am

Hi,
thanks for your answers,
forTe: i’d like to go this way, making my own file format to parse html code or BBcode…xml maybe ?

i stay tuned about this…what’s a hard work to do…but nothing is impossible !!!