How to extract underlined text from pdf Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 23, 2019 at 23:30 UTC (7:30 pm US/Eastern) Data science time! April 2019 and salary with experience The Ask Question Wizard is Live!How to merge two dictionaries in a single expression?How do I check if a list is empty?How do I check whether a file exists without exceptions?How can I safely create a nested directory in Python?How do I parse a string to a float or int in Python?Extracting extension from filename in PythonHow do I sort a dictionary by value?How do I list all files of a directory?How do you parse and process HTML/XML in PHP?Python - Extract formatted text (i.e. bold, italics, color) from pdf

Are there existing rules/lore for MTG planeswalkers?

What is the ongoing value of the Kanban board to the developers as opposed to management

All ASCII characters with a given bit count

How to translate "red flag" into Spanish?

Will I lose my paid in full property

Are these square matrices always diagonalisable?

Is it OK if I do not take the receipt in Germany?

Is it accepted to use working hours to read general interest books?

Is there a verb for listening stealthily?

What is a 'Key' in computer science?

Processing ADC conversion result: DMA vs Processor Registers

Why did Europeans not widely domesticate foxes?

How did Elite on the NES work?

What does the black goddess statue do and what is it?

How was Lagrange appointed professor of mathematics so early?

TV series episode where humans nuke aliens before decrypting their message that states they come in peace

How can I wire a 9-position switch so that each position turns on one more LED than the one before?

How long can a nation maintain a technological edge over the rest of the world?

Variable does not exist: sObjectType (Task.sObjectType)

Could a cockatrice have parasitic embryos?

Was there ever a LEGO store in Miami International Airport?

What is the purpose of the side handle on a hand ("eggbeater") drill?

Simulate round-robin tournament draw

Writing a T-SQL stored procedure to receive 4 numbers and insert them into a table



How to extract underlined text from pdf



Announcing the arrival of Valued Associate #679: Cesar Manara
Planned maintenance scheduled April 23, 2019 at 23:30 UTC (7:30 pm US/Eastern)
Data science time! April 2019 and salary with experience
The Ask Question Wizard is Live!How to merge two dictionaries in a single expression?How do I check if a list is empty?How do I check whether a file exists without exceptions?How can I safely create a nested directory in Python?How do I parse a string to a float or int in Python?Extracting extension from filename in PythonHow do I sort a dictionary by value?How do I list all files of a directory?How do you parse and process HTML/XML in PHP?Python - Extract formatted text (i.e. bold, italics, color) from pdf



.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty height:90px;width:728px;box-sizing:border-box;








-1















I tried pdfminer, pdfquery and other libraries and I get bold, italic, fonts, etc, but cannot get underlined text. For example when converting with pdfminer to html it creates DIVs with borders but not associated with the word.
Any idea of how can identify underlined text in a PDF? if possible using python.
Thank you!










share|improve this question






















  • just asking, why the -1? what other information can I add/is required? Thank you!

    – Alejandro
    Mar 28 at 12:51

















-1















I tried pdfminer, pdfquery and other libraries and I get bold, italic, fonts, etc, but cannot get underlined text. For example when converting with pdfminer to html it creates DIVs with borders but not associated with the word.
Any idea of how can identify underlined text in a PDF? if possible using python.
Thank you!










share|improve this question






















  • just asking, why the -1? what other information can I add/is required? Thank you!

    – Alejandro
    Mar 28 at 12:51













-1












-1








-1








I tried pdfminer, pdfquery and other libraries and I get bold, italic, fonts, etc, but cannot get underlined text. For example when converting with pdfminer to html it creates DIVs with borders but not associated with the word.
Any idea of how can identify underlined text in a PDF? if possible using python.
Thank you!










share|improve this question














I tried pdfminer, pdfquery and other libraries and I get bold, italic, fonts, etc, but cannot get underlined text. For example when converting with pdfminer to html it creates DIVs with borders but not associated with the word.
Any idea of how can identify underlined text in a PDF? if possible using python.
Thank you!







python parsing pdfminer






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked Mar 22 at 14:57









AlejandroAlejandro

77116




77116












  • just asking, why the -1? what other information can I add/is required? Thank you!

    – Alejandro
    Mar 28 at 12:51

















  • just asking, why the -1? what other information can I add/is required? Thank you!

    – Alejandro
    Mar 28 at 12:51
















just asking, why the -1? what other information can I add/is required? Thank you!

– Alejandro
Mar 28 at 12:51





just asking, why the -1? what other information can I add/is required? Thank you!

– Alejandro
Mar 28 at 12:51












0






active

oldest

votes












Your Answer






StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);













draft saved

draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55302394%2fhow-to-extract-underlined-text-from-pdf%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown

























0






active

oldest

votes








0






active

oldest

votes









active

oldest

votes






active

oldest

votes















draft saved

draft discarded
















































Thanks for contributing an answer to Stack Overflow!


  • Please be sure to answer the question. Provide details and share your research!

But avoid


  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55302394%2fhow-to-extract-underlined-text-from-pdf%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Kamusi Yaliyomo Aina za kamusi | Muundo wa kamusi | Faida za kamusi | Dhima ya picha katika kamusi | Marejeo | Tazama pia | Viungo vya nje | UrambazajiKuhusu kamusiGo-SwahiliWiki-KamusiKamusi ya Kiswahili na Kiingerezakuihariri na kuongeza habari

SQL error code 1064 with creating Laravel foreign keysForeign key constraints: When to use ON UPDATE and ON DELETEDropping column with foreign key Laravel error: General error: 1025 Error on renameLaravel SQL Can't create tableLaravel Migration foreign key errorLaravel php artisan migrate:refresh giving a syntax errorSQLSTATE[42S01]: Base table or view already exists or Base table or view already exists: 1050 Tableerror in migrating laravel file to xampp serverSyntax error or access violation: 1064:syntax to use near 'unsigned not null, modelName varchar(191) not null, title varchar(191) not nLaravel cannot create new table field in mysqlLaravel 5.7:Last migration creates table but is not registered in the migration table

은진 송씨 목차 역사 본관 분파 인물 조선 왕실과의 인척 관계 집성촌 항렬자 인구 같이 보기 각주 둘러보기 메뉴은진 송씨세종실록 149권, 지리지 충청도 공주목 은진현