gist.githubusercontent.com_simonw_6b76f8780bcbb8f6ad7b8c9f0dce5392_raw_617f45992427370d36be20ffd346bb84c07656b8_blog-paragraphs: 68
This data as json
rowid | new_id | clean_paragraph |
---|---|---|
68 | 36-0 | passage: I've been looking at PHP's XML handling functions (in particular the xml_parse() function) and I've suddenly realised the advantages of writing entries in valid XHTML. Before I started this blog one of the features I considered adding was something that can pull all of the links out of an entry when it is submitted and index them or add them to a directory somewhere. I was preparing myself for some regular expression hacking, but thanks to XHTML this is now completely unnecessary. All I need to do is define a couple of handlers to deal with <a> tags and Expat will do the hard work for me. In fact, this approach gives me a great deal of flexibility in what I do with my entries. I can extract quotes and blockquotes, pick up on emphasized text and generally allow my blog software to "understand" my entries as and when I add them. The true benefits of XHTML have suddenly become clear. |