I need to parse a large amount of text that uses HTML font tags for formatting,
For example:
<font face="fontname" ...>Some text</font>
Specifically, I need to determine which characters would be rendered using each font used in the text. I need to be able to handle stuff like font tags inside another font tag.
I need to use C# for this. Is there some sort of C# parser class to make this easier? Or would I have to write it myself?
Thanks!
You could load the HTML into Internet Explorer, and then query the DOM for font tags, (or CSS style).
Don’t know if this is the best option performance wise.