將Word轉成Markdown:word2markdown
這個工具能夠將 Word 轉成 Markdown,包含圖片和Math。 它由9個連續的步驟:
- Exporting to HTML using Microsoft Word 2012. We automated this on OS X using Automator. Solutions for other platforms are welcome!
- Extracting image types that we want to use. Keeps the original quality, unless that's a proprietary .emz file. In this step we also fix some math.
- Converting HTML to XML using tagsoup.
- Covert OOML (proprietary Word format) into MathML equations, using Microsoft's own conversion XSLT, and a custom version of this XSLT. Uses Saxon 8.
- Some intermediate fixes for whitespace and math.
- Conversion back into HTML using Tidy. Also strips a lot of stuff.
- More intermediate fixes to deal with shortcomings of Tidy and Pandoc.
- Conversion into Markdown using Pandoc.
- Lots of cleanup and final fixes to the Markdown.
環境要求
本文由用戶 jopen 自行上傳分享,僅供網友學習交流。所有權歸原作者,若您的權利被侵害,請聯系管理員。
轉載本站原創文章,請注明出處,并保留原始鏈接、圖片水印。
本站是一個以用戶分享為主的開源技術平臺,歡迎各類分享!