<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Convert HTML to Plain Text in C# using Markdown</title>
	<atom:link href="http://www.prolificnotion.co.uk/convert-html-to-plain-text-in-c-using-markdown/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.prolificnotion.co.uk/convert-html-to-plain-text-in-c-using-markdown/</link>
	<description>Welcome to the miscellaneous mutterings of Simon Dingley, Certified Umbraco Developer</description>
	<lastBuildDate>Fri, 13 Jan 2012 21:47:17 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Simon Dingley</title>
		<link>http://www.prolificnotion.co.uk/convert-html-to-plain-text-in-c-using-markdown/comment-page-1/#comment-10418</link>
		<dc:creator>Simon Dingley</dc:creator>
		<pubDate>Mon, 17 Oct 2011 07:36:05 +0000</pubDate>
		<guid isPermaLink="false">http://www.prolificnotion.co.uk/?p=262#comment-10418</guid>
		<description>Hi Kamil, not sure I understand what you mean. Could you possibly provide a little more detail on what you are trying to achieve?</description>
		<content:encoded><![CDATA[<p>Hi Kamil, not sure I understand what you mean. Could you possibly provide a little more detail on what you are trying to achieve?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kamil</title>
		<link>http://www.prolificnotion.co.uk/convert-html-to-plain-text-in-c-using-markdown/comment-page-1/#comment-10235</link>
		<dc:creator>Kamil</dc:creator>
		<pubDate>Tue, 11 Oct 2011 13:29:48 +0000</pubDate>
		<guid isPermaLink="false">http://www.prolificnotion.co.uk/?p=262#comment-10235</guid>
		<description>Hi, I&#039;m trying to alter how this xslt document works. I&#039;m trying to insert some kind of mark like &#039;[]&#039; or anything so that I could parse each text as individual part.

No matter where I&#039;m trying to insert  I can&#039;t modify how it works :(

Any help?</description>
		<content:encoded><![CDATA[<p>Hi, I&#8217;m trying to alter how this xslt document works. I&#8217;m trying to insert some kind of mark like &#8216;[]&#8216; or anything so that I could parse each text as individual part.</p>
<p>No matter where I&#8217;m trying to insert  I can&#8217;t modify how it works <img src='http://www.prolificnotion.co.uk/wp-includes/images/smilies/icon_sad.gif' alt=':(' class='wp-smiley' /> </p>
<p>Any help?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Simon Dingley</title>
		<link>http://www.prolificnotion.co.uk/convert-html-to-plain-text-in-c-using-markdown/comment-page-1/#comment-5677</link>
		<dc:creator>Simon Dingley</dc:creator>
		<pubDate>Sun, 08 May 2011 20:23:26 +0000</pubDate>
		<guid isPermaLink="false">http://www.prolificnotion.co.uk/?p=262#comment-5677</guid>
		<description>Hi Lee, glad it is proving of use let me know if you come up with any improvements, which I&#039;m sure you will.</description>
		<content:encoded><![CDATA[<p>Hi Lee, glad it is proving of use let me know if you come up with any improvements, which I&#8217;m sure you will.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Lee Kelleher</title>
		<link>http://www.prolificnotion.co.uk/convert-html-to-plain-text-in-c-using-markdown/comment-page-1/#comment-5675</link>
		<dc:creator>Lee Kelleher</dc:creator>
		<pubDate>Sun, 08 May 2011 15:25:14 +0000</pubDate>
		<guid isPermaLink="false">http://www.prolificnotion.co.uk/?p=262#comment-5675</guid>
		<description>... and 18 months later, I&#039;m search around for a HTML to Markdown utility and I find your blog post! Thank you!

Good find with the Symphony XSLT Helpers - Symphony is a great CMS with a community as dedicated as the Umbraco one!

I&#039;m planning on adding this conversion method to my Markdown Editor package as an XSLT extension, (in case anyone wants to convert back from HTML).

Cheers, Lee.</description>
		<content:encoded><![CDATA[<p>&#8230; and 18 months later, I&#8217;m search around for a HTML to Markdown utility and I find your blog post! Thank you!</p>
<p>Good find with the Symphony XSLT Helpers &#8211; Symphony is a great CMS with a community as dedicated as the Umbraco one!</p>
<p>I&#8217;m planning on adding this conversion method to my Markdown Editor package as an XSLT extension, (in case anyone wants to convert back from HTML).</p>
<p>Cheers, Lee.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Simon Dingley</title>
		<link>http://www.prolificnotion.co.uk/convert-html-to-plain-text-in-c-using-markdown/comment-page-1/#comment-3034</link>
		<dc:creator>Simon Dingley</dc:creator>
		<pubDate>Wed, 08 Sep 2010 19:07:55 +0000</pubDate>
		<guid isPermaLink="false">http://www.prolificnotion.co.uk/?p=262#comment-3034</guid>
		<description>With regards to the undeclared entity exception - do you have a doctype defined in the HTML you are loading?

To address the issues with the tags you mention you will need to look in the markdown.xml file, in particular the templates defined under the following comments:
[code language=&quot;xml&quot;]
 &lt;!-- tables --&gt;
 &lt;!-- unordered lists --&gt;
 &lt;!-- ordered lists --&gt;
[/code]

I think those elements you mention have templates setup to seperate elements using whitespace [code language=&quot;xml&quot;]&#xA;[/code]. You should be able to adapt it to suit your requirements.

I hope this is of some help?</description>
		<content:encoded><![CDATA[<p>With regards to the undeclared entity exception &#8211; do you have a doctype defined in the HTML you are loading?</p>
<p>To address the issues with the tags you mention you will need to look in the markdown.xml file, in particular the templates defined under the following comments:</p>
<pre class="brush: xml; title: ; notranslate">
 &lt;!-- tables --&gt;
 &lt;!-- unordered lists --&gt;
 &lt;!-- ordered lists --&gt;
</pre>
<p>I think those elements you mention have templates setup to seperate elements using whitespace
<pre class="brush: xml; title: ; notranslate">&amp;#xA;</pre>
<p>. You should be able to adapt it to suit your requirements.</p>
<p>I hope this is of some help?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: ludwigs3rd</title>
		<link>http://www.prolificnotion.co.uk/convert-html-to-plain-text-in-c-using-markdown/comment-page-1/#comment-3033</link>
		<dc:creator>ludwigs3rd</dc:creator>
		<pubDate>Wed, 08 Sep 2010 18:02:59 +0000</pubDate>
		<guid isPermaLink="false">http://www.prolificnotion.co.uk/?p=262#comment-3033</guid>
		<description>I had this concept for a project but figured I&#039;d search before reinventing the wheel. Good work, elegant solution!  :)
A couple of things...
I tried your solution but I&#039;m getting an exception:
Reference to undeclared entity &#039;nbsp&#039;.
at line:
xmlDoc.LoadXml(HTML);

Also, any UL, OL, or TABLE don&#039;t get any formatting. It&#039;s all one line of text squished together, no spaces or carriage returns.
I appreciate any advice you can offer.
Thanks!</description>
		<content:encoded><![CDATA[<p>I had this concept for a project but figured I&#8217;d search before reinventing the wheel. Good work, elegant solution!  <img src='http://www.prolificnotion.co.uk/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /><br />
A couple of things&#8230;<br />
I tried your solution but I&#8217;m getting an exception:<br />
Reference to undeclared entity &#8216;nbsp&#8217;.<br />
at line:<br />
xmlDoc.LoadXml(HTML);</p>
<p>Also, any UL, OL, or TABLE don&#8217;t get any formatting. It&#8217;s all one line of text squished together, no spaces or carriage returns.<br />
I appreciate any advice you can offer.<br />
Thanks!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jim</title>
		<link>http://www.prolificnotion.co.uk/convert-html-to-plain-text-in-c-using-markdown/comment-page-1/#comment-2195</link>
		<dc:creator>Jim</dc:creator>
		<pubDate>Tue, 15 Dec 2009 17:08:52 +0000</pubDate>
		<guid isPermaLink="false">http://www.prolificnotion.co.uk/?p=262#comment-2195</guid>
		<description>No problem! I will check back every so often to see if you were able to find a solution. For now, I&#039;m just keeping the Title text in my output.

I still think your solution is superior so my request is just a refinement to an already superb approach.

Regards,
Jim</description>
		<content:encoded><![CDATA[<p>No problem! I will check back every so often to see if you were able to find a solution. For now, I&#8217;m just keeping the Title text in my output.</p>
<p>I still think your solution is superior so my request is just a refinement to an already superb approach.</p>
<p>Regards,<br />
Jim</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Simon Dingley</title>
		<link>http://www.prolificnotion.co.uk/convert-html-to-plain-text-in-c-using-markdown/comment-page-1/#comment-2160</link>
		<dc:creator>Simon Dingley</dc:creator>
		<pubDate>Mon, 07 Dec 2009 12:16:18 +0000</pubDate>
		<guid isPermaLink="false">http://www.prolificnotion.co.uk/?p=262#comment-2160</guid>
		<description>Jim, sorry for the delayed reply. Unfortunately I have been unsuccessful also in my attempts to achieve what you are after. I will continue to try when I get a chance and will post back on my success(or failure).</description>
		<content:encoded><![CDATA[<p>Jim, sorry for the delayed reply. Unfortunately I have been unsuccessful also in my attempts to achieve what you are after. I will continue to try when I get a chance and will post back on my success(or failure).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: JimHou</title>
		<link>http://www.prolificnotion.co.uk/convert-html-to-plain-text-in-c-using-markdown/comment-page-1/#comment-2120</link>
		<dc:creator>JimHou</dc:creator>
		<pubDate>Sat, 28 Nov 2009 02:05:33 +0000</pubDate>
		<guid isPermaLink="false">http://www.prolificnotion.co.uk/?p=262#comment-2120</guid>
		<description>Hi Simon,

I tried your suggestion (replacing &quot;*&quot; with &quot;body&quot; where indicated) but had no luck (the Title tag value is still appearing in the resulting text string). Although I was very careful to make exactly the change you suggested (down to the case, of course, of the word &quot;body&quot;), it is entirely possible that the problem is somehow with my implementation...

If you can think of any other ideas, I would be happy to try them. Meanwhile, I am truncating the start of the string as needed (works, but not as elegant).

Regards,
Jim</description>
		<content:encoded><![CDATA[<p>Hi Simon,</p>
<p>I tried your suggestion (replacing &#8220;*&#8221; with &#8220;body&#8221; where indicated) but had no luck (the Title tag value is still appearing in the resulting text string). Although I was very careful to make exactly the change you suggested (down to the case, of course, of the word &#8220;body&#8221;), it is entirely possible that the problem is somehow with my implementation&#8230;</p>
<p>If you can think of any other ideas, I would be happy to try them. Meanwhile, I am truncating the start of the string as needed (works, but not as elegant).</p>
<p>Regards,<br />
Jim</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: JimHou</title>
		<link>http://www.prolificnotion.co.uk/convert-html-to-plain-text-in-c-using-markdown/comment-page-1/#comment-2118</link>
		<dc:creator>JimHou</dc:creator>
		<pubDate>Fri, 27 Nov 2009 17:42:27 +0000</pubDate>
		<guid isPermaLink="false">http://www.prolificnotion.co.uk/?p=262#comment-2118</guid>
		<description>Thanks again. I will try this (the suggestion makes sense--restricting to the body tag). As before I will let you know if I encounter any problems.

Regards,
Jim</description>
		<content:encoded><![CDATA[<p>Thanks again. I will try this (the suggestion makes sense&#8211;restricting to the body tag). As before I will let you know if I encounter any problems.</p>
<p>Regards,<br />
Jim</p>
]]></content:encoded>
	</item>
</channel>
</rss>

