{"id":22098,"date":"2020-01-27T21:49:37","date_gmt":"2020-01-27T21:49:37","guid":{"rendered":"http:\/\/pre-production.taftcollege.edu\/high-tech-center\/?page_id=22098"},"modified":"2020-03-16T15:25:54","modified_gmt":"2020-03-16T15:25:54","slug":"best-practices","status":"publish","type":"page","link":"https:\/\/archive.taftcollege.edu\/high-tech-center\/tc-convert\/best-practices\/","title":{"rendered":"Best Practices"},"content":{"rendered":"<section class=\"wpb-content-wrapper\"><p>[vc_row css_animation=&#8221;&#8221; row_type=&#8221;row&#8221; use_row_as_full_screen_section=&#8221;no&#8221; type=&#8221;full_width&#8221; text_align=&#8221;left&#8221;][vc_column]<div id=\"ultimate-heading-744069e5fdb7a8c13\" class=\"uvc-heading ult-adjust-bottom-margin ultimate-heading-744069e5fdb7a8c13 uvc-7317 \" data-hspacer=\"line_only\"  data-halign=\"left\" style=\"text-align:left\"><div class=\"uvc-main-heading ult-responsive\"  data-ultimate-target='.uvc-heading.ultimate-heading-744069e5fdb7a8c13 h2'  data-responsive-json-new='{\"font-size\":\"\",\"line-height\":\"\"}' ><h2 style=\"font-weight:normal;\">Conversion Best Practices <\/h2><\/div><div class=\"uvc-heading-spacer line_only\" style=\"topheight:1px;\"><span class=\"uvc-headings-line\" style=\"border-style:solid;border-bottom-width:1px;border-color:#ccc;width:300px;\"><\/span><\/div><div class=\"uvc-sub-heading ult-responsive\"  data-ultimate-target='.uvc-heading.ultimate-heading-744069e5fdb7a8c13 .uvc-sub-heading '  data-responsive-json-new='{\"font-size\":\"\",\"line-height\":\"\"}'  style=\"font-weight:normal;\"><\/p>\n<p style=\"text-align: left;\">Provided courtesy of Sean Keegan. The material may be used in connection with SensusAccess solutions<br \/>\nprovided Sean Keegan is credited.<\/p>\n<p>The quality of a conversion is dependent upon the quality of the original document. Additionally, the<br \/>\nresulting output format may include enhancements for navigation if the original file contains the<br \/>\nappropriate semantic markup. For instance, a MS Word document containing the heading style markup<br \/>\nfor chapters (e.g., Heading 1, Heading 2, etc.) will convert into a more usable DAISY or EPUB format with<br \/>\nthe relevant chapter navigation elements. The following best practices identify simple methods to<br \/>\nprepare the file before converting in order to achieve a high-quality output.<\/p>\n<p><\/div><\/div>[\/vc_column][\/vc_row][vc_row css_animation=&#8221;&#8221; row_type=&#8221;row&#8221; use_row_as_full_screen_section=&#8221;no&#8221; type=&#8221;full_width&#8221; text_align=&#8221;left&#8221;][vc_column][vc_accordion active_tab=&#8221;1&#8243; style=&#8221;boxed_toggle&#8221;][vc_accordion_tab title=&#8221;PDF &amp; Image-based Files&#8221; el_id=&#8221;image-based-files&#8221;][vc_column_text]<\/p>\n<ul>\n<li>PDF and image-based files will be processed using optical character recognition (OCR) to create a text-based version of the<\/li>\n<li>If scanning the document, ensure the scanned image is free from smudges, dark marks, highlighted text, or artifacts in the image. These will affect the accuracy of the OCR<\/li>\n<li>Minimize the any effects from skewing. If the image is presented at an &#8220;off-angle&#8221;, the accuracy of the OCR process will be lower resulting in a lower quality text<\/li>\n<li>If you are starting with an image-based format and wish to convert to a text format, you may achieve better results by initially converting to Tagged PDF and then copying\/pasting the text into a MS Word document. While you can convert directly from an image file to a text file with SensusAccess, you may find better results for some image documents if converting to Tagged PDF and then to a text file (see &#8220;Converting to MS Word and Text Files&#8221; section).<\/li>\n<\/ul>\n<p>[\/vc_column_text][\/vc_accordion_tab][vc_accordion_tab title=&#8221;Converting to MS Word and Text Files&#8221; el_id=&#8221;#word-docs&#8221;][vc_column_text]SensusAccess will convert image-based documents into MS Word, RTF, and text files. You may also find it useful with some image-based documents to convert initially to Tagged PDF and then copy and paste the text from the Tagged PDF into MS Word. This may result in a better reading experience and may remove non-essential content.<\/p>\n<p>With the MS Word version of the document, you can more accurately &#8220;clean&#8221; the content for conversion into MP3 audio or for use with assistive technologies. Most conversions will take just a few seconds within MS Word and involve the use of the Find and Replace tools. For more information on using the find and Replace tools, see <a href=\"http:\/\/office.microsoft.com\/en-us\/word-help\/find-and-replace-text-or-other-items-HA001230392.aspx#_Toc282602052\">Using the Find and Replace in MS Word <\/a>removing special characters in a document.<\/p>\n<p><em>Please note &#8211; in the Find and Replace examples below, replace the &lt;space&gt; value with one spacebar and do not include the quotes.<\/em>[\/vc_column_text][\/vc_accordion_tab][vc_accordion_tab title=&#8221;Image-File to Tagged PDF to MS Word Document&#8221; el_id=&#8221;image-based-to-tagged-pdf&#8221;][vc_column_text]<\/p>\n<ul>\n<li>Submit the image-based document to SensusAccess and select Tagged PDF as the output<\/li>\n<li>Open the Tagged PDF and select all the text. Copy and paste this into a MS Word document (Open Office may also be used).<\/li>\n<li>Using Find and Replace:\n<ul>\n<li>Search for &#8220;.&lt;space&gt;^p&#8221; and replace with &#8220;.^p^p&#8221; .<\/li>\n<li>Search for &#8220;&lt;space&gt;^p&#8221; and replace with &#8220;&lt;space&gt;&#8221; .<\/li>\n<li>Search for &#8220;&lt;space&gt;\u2022&lt;space&gt;&#8221; and replace with &#8220;^p\u2022&lt;space&gt;&#8221; .<\/li>\n<li>Search for &#8220;-&lt;space&gt;&#8221; and replace with no<\/li>\n<\/ul>\n<\/li>\n<li>Save the document in your preferred text format.<\/li>\n<\/ul>\n<p>[\/vc_column_text][\/vc_accordion_tab][vc_accordion_tab title=&#8221;Image-File to MS Word Document&#8221; el_id=&#8221;image-to-word&#8221;][vc_column_text]To clean-up a MS Word file for use with assistive technology or for creating MP3 files, perform a &#8220;search and replace&#8221; to remove optional hyphens and section breaks. Identify the special character you wish to find in the &#8220;Find:&#8221; box and leave the &#8220;Replace with:&#8221; box empty. See <a href=\"http:\/\/office.microsoft.com\/en-us\/word-help\/find-and-replace-text-or-other-items-HA001230392.aspx#_Toc282602052\">Using the Find and Replace in MS<\/a> <a href=\"http:\/\/office.microsoft.com\/en-us\/word-help\/find-and-replace-text-or-other-items-HA001230392.aspx#_Toc282602052\">Word <\/a>for additional information on removing special characters in a document.<\/p>\n<ul>\n<li>Submit the image-based document to SensusAccess and select Microsoft Word as the output option.<\/li>\n<li>Open the converted Microsoft Word document (Open Office may also be used).<\/li>\n<li>Using Find and Replace:\n<ul>\n<li>Search for &#8220;Optional Hyphen&#8221; under Special Formatting and replace with no<\/li>\n<li>Search for &#8220;Section Breaks&#8221; under Special Formatting and replace with &#8220;^p^p&#8221;.<\/li>\n<li>Search for &#8220;Manual Page Breaks&#8221; and replace with &#8220;^p^p&#8221;.<\/li>\n<\/ul>\n<\/li>\n<li>Save the document in your preferred text format.<\/li>\n<\/ul>\n<p>[\/vc_column_text][\/vc_accordion_tab][vc_accordion_tab title=&#8221;Authoring MS Word, RTF, Text Files&#8221; el_id=&#8221;authoring-word-files&#8221;][vc_column_text]<\/p>\n<ul>\n<li>Use Word styles to specify document headings. For example, the style &#8220;Heading 1&#8221; could be used to identify the title of the document and the style &#8220;Heading 2&#8221; could be used to identify chapter information. It is best to use only one &#8220;Heading 1&#8221; to facilitate accurate conversions into other document formats (e.g., DAISY, EPUB, Braille, ).<\/li>\n<li>Provide short descriptions for content-related images in your MS Word document.<\/li>\n<li>Avoid using text-boxes in your document. If you want to customize the layout, use a Column Tool or a Section<\/li>\n<li>If converting to DAISY, page numbers will be identified based on the MS Word pagination. To obtain custom pagination, use the PageNumber style from the <a href=\"http:\/\/www.daisy.org\/project\/save-as-daisy-microsoft-word-add-in\">Save As DAISY plug-in for<\/a><a href=\"http:\/\/www.daisy.org\/project\/save-as-daisy-microsoft-word-add-in\"> Microsoft Office <\/a>for your custom page numbers.<\/li>\n<\/ul>\n<p>[\/vc_column_text][\/vc_accordion_tab][vc_accordion_tab title=&#8221;Authoring HTML Files&#8221; el_id=&#8221;authoring-html-files&#8221;][vc_column_text]<\/p>\n<ul>\n<li>Use HTML heading markup (e.g., &lt;h1&gt;, &lt;h2&gt;, etc.) to designate headings in the document. For example, the style &#8220;Heading 1&#8221; could be used to identify the title of the document and the style &#8220;Heading 2&#8221; could be used to identify chapter<\/li>\n<li>Provide short descriptions for content-related images in the HTML document.<\/li>\n<\/ul>\n<p>[\/vc_column_text][\/vc_accordion_tab][\/vc_accordion][\/vc_column][\/vc_row]<\/p>\n<\/section>","protected":false},"excerpt":{"rendered":"<p>[vc_row css_animation=&#8221;&#8221; row_type=&#8221;row&#8221; use_row_as_full_screen_section=&#8221;no&#8221; type=&#8221;full_width&#8221; text_align=&#8221;left&#8221;][vc_column]<div id=\"ultimate-heading-644869e5fdb7a91bb\" class=\"uvc-heading ult-adjust-bottom-margin ultimate-heading-644869e5fdb7a91bb uvc-2516 \" data-hspacer=\"line_only\"  data-halign=\"left\" style=\"text-align:left\"><div class=\"uvc-main-heading ult-responsive\"  data-ultimate-target='.uvc-heading.ultimate-heading-644869e5fdb7a91bb h2'  data-responsive-json-new='{\"font-size\":\"\",\"line-height\":\"\"}' ><h2 style=\"font-weight:normal;\">Conversion Best Practices <\/h2><\/div><div class=\"uvc-heading-spacer line_only\" style=\"topheight:1px;\"><span class=\"uvc-headings-line\" style=\"border-style:solid;border-bottom-width:1px;border-color:#ccc;width:300px;\"><\/span><\/div><\/div>  Provided courtesy of Sean Keegan. The material may be used in connection with SensusAccess solutions  provided Sean Keegan is credited.  The quality of a conversion is dependent upon the quality of the original document. Additionally, the<\/p>\n","protected":false},"author":1,"featured_media":0,"parent":22093,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_relevanssi_hide_post":"on","_relevanssi_hide_content":"","_relevanssi_pin_for_all":"","_relevanssi_pin_keywords":"","_relevanssi_unpin_keywords":"","_relevanssi_related_keywords":"","_relevanssi_related_include_ids":"","_relevanssi_related_exclude_ids":"","_relevanssi_related_no_append":"","_relevanssi_related_not_related":"","_relevanssi_related_posts":"","_relevanssi_noindex_reason":"","inline_featured_image":false,"footnotes":""},"class_list":["post-22098","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/archive.taftcollege.edu\/high-tech-center\/wp-json\/wp\/v2\/pages\/22098","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/archive.taftcollege.edu\/high-tech-center\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/archive.taftcollege.edu\/high-tech-center\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/archive.taftcollege.edu\/high-tech-center\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/archive.taftcollege.edu\/high-tech-center\/wp-json\/wp\/v2\/comments?post=22098"}],"version-history":[{"count":3,"href":"https:\/\/archive.taftcollege.edu\/high-tech-center\/wp-json\/wp\/v2\/pages\/22098\/revisions"}],"predecessor-version":[{"id":22111,"href":"https:\/\/archive.taftcollege.edu\/high-tech-center\/wp-json\/wp\/v2\/pages\/22098\/revisions\/22111"}],"up":[{"embeddable":true,"href":"https:\/\/archive.taftcollege.edu\/high-tech-center\/wp-json\/wp\/v2\/pages\/22093"}],"wp:attachment":[{"href":"https:\/\/archive.taftcollege.edu\/high-tech-center\/wp-json\/wp\/v2\/media?parent=22098"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}