Search

Reinhart Previano K.

Do you love to Ctrl-K, Ctrl-/, or / ? Now you can do three of them (>_ )!

No results so far...

Contact Information

Q&A: How do you get plain text from HTML?

Reinhart Previano Koentjoro's profile picture

Reinhart Previano Koentjoro (@reinhart)

Published on Q&A

  1. You can view the plain text from any of HTML tags: <p>, <h1>, <h2>, <h3> <h4>, <h5>, <h6>, <td> (inside a <tr> in a <table>), <th> (inside a <tr> in a <table>) and all text inside a tag are texts.
  2. Some of the text may disappear from the webpage. This is done by CSS, JS and other scripts.
  3. Some of the text may changed by a script (such as JavaScript). You can see the final results in the web inspector.
  4. Some of the texts are images. This cannot be converted unless the image is converted by some kind of OCRs and Online OCRs.
  5. Some of the texts may be hidden due to another object blocking it. For example, a text is located inside the <div> tag but blocked by other elements inside the tag. Another example is when the text inside the HTML5 <video> tag is hidden because the tag shows content in a video format.
  6. Some images have hidden text, too. This can be located by the attribute alt="..." inside the <img> tag.
  7. Texts that are inside of an applet/object that requires plugins (such as Java, Flash and Silverlight) may not be copied completely as text. Meanwhile, you can still obtain the plain text from HTML via Reading View feature that is available on some browsers such as Android Stock browser, Firefox and Safari. There may be extensions available for Chrome and other browser's users.
Share Copy Link Print PDF Embed Share to Email Share to SMS Yahoo! Share to Yahoo! Mail Mastodon Share to Mastodon Share to KakaoStory Messenger Share to Messenger Pocket Share to Pocket Flipboard Share to Flipboard Pinterest Share to Pinterest Reddit Share to Reddit Y Combinator Share to Hacker News Odnoklassniki Share to Odnoklassniki Blogger Share to Blogger Pleroma Share to Pleroma Share to Friendica Share to KakaoTalk 1Artboard 1 copy 2 Share to Snapchat Xing Share to Xing Share to Misskey LINE Share to LINE Evernote Share to Evernote WhatsApp Share to WhatsApp LiveJournal Share to Livejournal Diaspora Share to Diaspora Share to Gmail Threads Share to Threads Threema Share to Threema Share to X Tumblr Share to Tumblr Buffer Share to Buffer LinkedIn Share to LinkedIn Mail.Ru Share to mail.ru VK Share to VKontakte Trello Share to Trello Facebook Share to Facebook Bluesky Share to Bluesky Skype Share to Skype Hatena Bookmark Share to Hatena Bookmark! Share via MastodonShare Telegram Share to Telegram WordPress Share to WordPress.com

Embed

This website supports oEmbed. To quickly use oEmbed, just copy this site's link to your oEmbed-supported apps and websites like WordPress.

Alternatively, copy and paste the HTML code below to embed this post in your website.

($_ )! We have made this thing responsive, but recommend at least 512x512 pixels for best results.
<iframe src="https://reinhart1010.id/blog/2016/06/12/how-do-you-get-plain-text-from-html?embed" height="512" width="512" style="border:none;"><a href="{{ $canonical }}">https://reinhart1010.id/blog/2016/06/12/how-do-you-get-plain-text-from-html</a></iframe>
Preview

Reinhart Previano Koentjoro
Reinhart Previano Koentjoro
Citra Manggala Dirgantara
Citra Manggala Dirgantara

A Reinhart company

Products

Company