Using regex in python to obtain multiple repeating linesCalling an external command in PythonWhat are metaclasses in Python?Finding the index of an item given a list containing it in PythonWhat is the difference between Python's list methods append and extend?How can I safely create a nested directory?Does Python have a ternary conditional operator?Regular expression to match a line that doesn't contain a wordHow to read a file line-by-line into a list?Does Python have a string 'contains' substring method?Catch multiple exceptions in one line (except block)
Full backup on database creation
Plot twist where the antagonist wins
How can people dance around bonfires on Lag Lo'Omer - it's darchei emori?
Command to Search for Filenames Exceeding 143 Characters?
How to make a crossed out leftrightarrow?
Is there a public standard for 8 and 10 character grid locators?
Canon 70D often overexposing or underexposing shots
How did early x86 BIOS programmers manage to program full blown TUIs given very few bytes of ROM/EPROM?
When and what was the first 3D acceleration device ever released?
Why does the 'metric Lagrangian' approach appear to fail in Newtonian mechanics?
Array Stutter Implementation
Looking for a soft substance that doesn't dissolve underwater
Is there an efficient way to replace text matching the entire content of one file with the entire content of another file?
How to capture more stars?
What's the Difference between Two Single-Quotes and One Double-Quote?
What does the view outside my ship traveling at light speed look like?
Employer demanding to see degree after poor code review
Were pens caps holes designed to prevent death by suffocation if swallowed?
Why do airplanes use an axial flow jet engine instead of a more compact centrifugal jet engine?
Different circular sectors as new logo of the International System
How bitcoin nodes update UTXO set when their latests blocks are replaced?
Riley Rebuses that Share a Common Theme
Is floating in space similar to falling under gravity?
Python program to convert a 24 hour format to 12 hour format
Using regex in python to obtain multiple repeating lines
Calling an external command in PythonWhat are metaclasses in Python?Finding the index of an item given a list containing it in PythonWhat is the difference between Python's list methods append and extend?How can I safely create a nested directory?Does Python have a ternary conditional operator?Regular expression to match a line that doesn't contain a wordHow to read a file line-by-line into a list?Does Python have a string 'contains' substring method?Catch multiple exceptions in one line (except block)
.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty height:90px;width:728px;box-sizing:border-box;
I'm very new to RegEx and have a very large text file, a small portion of which is shown below:
<div class="hbk-preamble " id="preamble-APG5180">
<div class="hbk-preamble-entry">
<div class="hbk-preamble-icon hbk-preamble-icon_mode"></div>
<p class="hbk-preamble-heading">Offered</p>
<p><a href="index-bylocation-city-melbourne.html">City (Melbourne)</a></p><ul class="hbk-preamble-list__offerings"><li>Summer semester A 2019 (Flexible)</li></ul><p><a href="index-bylocation-clayton.html">Clayton</a></p><ul class="hbk-preamble-list__offerings"><li>First semester 2019 (On-campus)</li></ul>
</div>
</div>
<div class="notes">
<p class="hbk-heading hdg_6">Notes</p>
<p></p><ul>
<li>The unit may be offered as part of the <a class="hbk-screen-url" href="http://www.monash.edu/students/courses/arts/summer-program.html">Summer Arts Program</a><span class="hbk-print-url">Summer Arts Program (<a href="http://www.monash.edu/students/courses/arts/summer-program.html">http://www.monash.edu/students/courses/arts/summer-program.html</a>)</span>.</li>
<li>For more information please visit the <a class="hbk-screen-url" href="https://www.anzsog.edu.au/">ANZSOG webpage</a><span class="hbk-print-url">ANZSOG webpage (<a href="https://www.anzsog.edu.au/">https://www.anzsog.edu.au/</a>)</span>.</li>
</ul>
</div>
<h2 class="hbk-heading">Synopsis</h2>
<div>
<p>The media is one of the most important components of any political society. In a liberal democracy like Australia, its role and function have profound implications for the conduct of politics, the nature of democracy and public policy outcomes. In this unit, the relationship between the media, politics and public policy is studied from three broad perspectives. First, the politics of the media is investigated from the perspective of liberal democratic theory in order to understand the role of news media on the policy debate. Second, the political economy of the media is investigated. Particular emphasis is on the structure and operation of media organisations and journalists and how political news is covered. Third, the unit undertakes a study of the relationship between the media and political actors. Particular emphasis is on the use of public relations and 'spin doctors' in managing the media as well as the utilisation of political advertising and strategic political communication by governments and political agents.</p>
</div>
<h2 class="hbk-heading">Outcomes</h2>
<div>
<p>Upon successful completion of the unit students should have:</p>
<ol princestart="0" start="1" type="1">
I would like to use RegEx to get only the 'Synopsis' text out of it:
The media is one of the most important components of any political society. In a liberal democracy like Australia, its role and function have profound implications for the conduct of politics, the nature of democracy and public policy outcomes. In this unit, the relationship between the media, politics and public policy is studied from three broad perspectives. First, the politics of the media is investigated from the perspective of liberal democratic theory in order to understand the role of news media on the policy debate. Second, the political economy of the media is investigated. Particular emphasis is on the structure and operation of media organisations and journalists and how political news is covered. Third, the unit undertakes a study of the relationship between the media and political actors. Particular emphasis is on the use of public relations and 'spin doctors' in managing the media as well as the utilisation of political advertising and strategic political communication by governments and political agents.
I need the synopsis text out for every section in the text file, what should I do?
So far, I've read in my text file using read and readlines, but I can't establish a pattern to get started.
python regex
add a comment |
I'm very new to RegEx and have a very large text file, a small portion of which is shown below:
<div class="hbk-preamble " id="preamble-APG5180">
<div class="hbk-preamble-entry">
<div class="hbk-preamble-icon hbk-preamble-icon_mode"></div>
<p class="hbk-preamble-heading">Offered</p>
<p><a href="index-bylocation-city-melbourne.html">City (Melbourne)</a></p><ul class="hbk-preamble-list__offerings"><li>Summer semester A 2019 (Flexible)</li></ul><p><a href="index-bylocation-clayton.html">Clayton</a></p><ul class="hbk-preamble-list__offerings"><li>First semester 2019 (On-campus)</li></ul>
</div>
</div>
<div class="notes">
<p class="hbk-heading hdg_6">Notes</p>
<p></p><ul>
<li>The unit may be offered as part of the <a class="hbk-screen-url" href="http://www.monash.edu/students/courses/arts/summer-program.html">Summer Arts Program</a><span class="hbk-print-url">Summer Arts Program (<a href="http://www.monash.edu/students/courses/arts/summer-program.html">http://www.monash.edu/students/courses/arts/summer-program.html</a>)</span>.</li>
<li>For more information please visit the <a class="hbk-screen-url" href="https://www.anzsog.edu.au/">ANZSOG webpage</a><span class="hbk-print-url">ANZSOG webpage (<a href="https://www.anzsog.edu.au/">https://www.anzsog.edu.au/</a>)</span>.</li>
</ul>
</div>
<h2 class="hbk-heading">Synopsis</h2>
<div>
<p>The media is one of the most important components of any political society. In a liberal democracy like Australia, its role and function have profound implications for the conduct of politics, the nature of democracy and public policy outcomes. In this unit, the relationship between the media, politics and public policy is studied from three broad perspectives. First, the politics of the media is investigated from the perspective of liberal democratic theory in order to understand the role of news media on the policy debate. Second, the political economy of the media is investigated. Particular emphasis is on the structure and operation of media organisations and journalists and how political news is covered. Third, the unit undertakes a study of the relationship between the media and political actors. Particular emphasis is on the use of public relations and 'spin doctors' in managing the media as well as the utilisation of political advertising and strategic political communication by governments and political agents.</p>
</div>
<h2 class="hbk-heading">Outcomes</h2>
<div>
<p>Upon successful completion of the unit students should have:</p>
<ol princestart="0" start="1" type="1">
I would like to use RegEx to get only the 'Synopsis' text out of it:
The media is one of the most important components of any political society. In a liberal democracy like Australia, its role and function have profound implications for the conduct of politics, the nature of democracy and public policy outcomes. In this unit, the relationship between the media, politics and public policy is studied from three broad perspectives. First, the politics of the media is investigated from the perspective of liberal democratic theory in order to understand the role of news media on the policy debate. Second, the political economy of the media is investigated. Particular emphasis is on the structure and operation of media organisations and journalists and how political news is covered. Third, the unit undertakes a study of the relationship between the media and political actors. Particular emphasis is on the use of public relations and 'spin doctors' in managing the media as well as the utilisation of political advertising and strategic political communication by governments and political agents.
I need the synopsis text out for every section in the text file, what should I do?
So far, I've read in my text file using read and readlines, but I can't establish a pattern to get started.
python regex
3
Please look into using an XML/HTML parser in Python, which I believe natively supports them. Using regex to parse HTML is generally evil. Many kittens will be dying if you continue doing this.
– Tim Biegeleisen
Mar 24 at 7:19
add a comment |
I'm very new to RegEx and have a very large text file, a small portion of which is shown below:
<div class="hbk-preamble " id="preamble-APG5180">
<div class="hbk-preamble-entry">
<div class="hbk-preamble-icon hbk-preamble-icon_mode"></div>
<p class="hbk-preamble-heading">Offered</p>
<p><a href="index-bylocation-city-melbourne.html">City (Melbourne)</a></p><ul class="hbk-preamble-list__offerings"><li>Summer semester A 2019 (Flexible)</li></ul><p><a href="index-bylocation-clayton.html">Clayton</a></p><ul class="hbk-preamble-list__offerings"><li>First semester 2019 (On-campus)</li></ul>
</div>
</div>
<div class="notes">
<p class="hbk-heading hdg_6">Notes</p>
<p></p><ul>
<li>The unit may be offered as part of the <a class="hbk-screen-url" href="http://www.monash.edu/students/courses/arts/summer-program.html">Summer Arts Program</a><span class="hbk-print-url">Summer Arts Program (<a href="http://www.monash.edu/students/courses/arts/summer-program.html">http://www.monash.edu/students/courses/arts/summer-program.html</a>)</span>.</li>
<li>For more information please visit the <a class="hbk-screen-url" href="https://www.anzsog.edu.au/">ANZSOG webpage</a><span class="hbk-print-url">ANZSOG webpage (<a href="https://www.anzsog.edu.au/">https://www.anzsog.edu.au/</a>)</span>.</li>
</ul>
</div>
<h2 class="hbk-heading">Synopsis</h2>
<div>
<p>The media is one of the most important components of any political society. In a liberal democracy like Australia, its role and function have profound implications for the conduct of politics, the nature of democracy and public policy outcomes. In this unit, the relationship between the media, politics and public policy is studied from three broad perspectives. First, the politics of the media is investigated from the perspective of liberal democratic theory in order to understand the role of news media on the policy debate. Second, the political economy of the media is investigated. Particular emphasis is on the structure and operation of media organisations and journalists and how political news is covered. Third, the unit undertakes a study of the relationship between the media and political actors. Particular emphasis is on the use of public relations and 'spin doctors' in managing the media as well as the utilisation of political advertising and strategic political communication by governments and political agents.</p>
</div>
<h2 class="hbk-heading">Outcomes</h2>
<div>
<p>Upon successful completion of the unit students should have:</p>
<ol princestart="0" start="1" type="1">
I would like to use RegEx to get only the 'Synopsis' text out of it:
The media is one of the most important components of any political society. In a liberal democracy like Australia, its role and function have profound implications for the conduct of politics, the nature of democracy and public policy outcomes. In this unit, the relationship between the media, politics and public policy is studied from three broad perspectives. First, the politics of the media is investigated from the perspective of liberal democratic theory in order to understand the role of news media on the policy debate. Second, the political economy of the media is investigated. Particular emphasis is on the structure and operation of media organisations and journalists and how political news is covered. Third, the unit undertakes a study of the relationship between the media and political actors. Particular emphasis is on the use of public relations and 'spin doctors' in managing the media as well as the utilisation of political advertising and strategic political communication by governments and political agents.
I need the synopsis text out for every section in the text file, what should I do?
So far, I've read in my text file using read and readlines, but I can't establish a pattern to get started.
python regex
I'm very new to RegEx and have a very large text file, a small portion of which is shown below:
<div class="hbk-preamble " id="preamble-APG5180">
<div class="hbk-preamble-entry">
<div class="hbk-preamble-icon hbk-preamble-icon_mode"></div>
<p class="hbk-preamble-heading">Offered</p>
<p><a href="index-bylocation-city-melbourne.html">City (Melbourne)</a></p><ul class="hbk-preamble-list__offerings"><li>Summer semester A 2019 (Flexible)</li></ul><p><a href="index-bylocation-clayton.html">Clayton</a></p><ul class="hbk-preamble-list__offerings"><li>First semester 2019 (On-campus)</li></ul>
</div>
</div>
<div class="notes">
<p class="hbk-heading hdg_6">Notes</p>
<p></p><ul>
<li>The unit may be offered as part of the <a class="hbk-screen-url" href="http://www.monash.edu/students/courses/arts/summer-program.html">Summer Arts Program</a><span class="hbk-print-url">Summer Arts Program (<a href="http://www.monash.edu/students/courses/arts/summer-program.html">http://www.monash.edu/students/courses/arts/summer-program.html</a>)</span>.</li>
<li>For more information please visit the <a class="hbk-screen-url" href="https://www.anzsog.edu.au/">ANZSOG webpage</a><span class="hbk-print-url">ANZSOG webpage (<a href="https://www.anzsog.edu.au/">https://www.anzsog.edu.au/</a>)</span>.</li>
</ul>
</div>
<h2 class="hbk-heading">Synopsis</h2>
<div>
<p>The media is one of the most important components of any political society. In a liberal democracy like Australia, its role and function have profound implications for the conduct of politics, the nature of democracy and public policy outcomes. In this unit, the relationship between the media, politics and public policy is studied from three broad perspectives. First, the politics of the media is investigated from the perspective of liberal democratic theory in order to understand the role of news media on the policy debate. Second, the political economy of the media is investigated. Particular emphasis is on the structure and operation of media organisations and journalists and how political news is covered. Third, the unit undertakes a study of the relationship between the media and political actors. Particular emphasis is on the use of public relations and 'spin doctors' in managing the media as well as the utilisation of political advertising and strategic political communication by governments and political agents.</p>
</div>
<h2 class="hbk-heading">Outcomes</h2>
<div>
<p>Upon successful completion of the unit students should have:</p>
<ol princestart="0" start="1" type="1">
I would like to use RegEx to get only the 'Synopsis' text out of it:
The media is one of the most important components of any political society. In a liberal democracy like Australia, its role and function have profound implications for the conduct of politics, the nature of democracy and public policy outcomes. In this unit, the relationship between the media, politics and public policy is studied from three broad perspectives. First, the politics of the media is investigated from the perspective of liberal democratic theory in order to understand the role of news media on the policy debate. Second, the political economy of the media is investigated. Particular emphasis is on the structure and operation of media organisations and journalists and how political news is covered. Third, the unit undertakes a study of the relationship between the media and political actors. Particular emphasis is on the use of public relations and 'spin doctors' in managing the media as well as the utilisation of political advertising and strategic political communication by governments and political agents.
I need the synopsis text out for every section in the text file, what should I do?
So far, I've read in my text file using read and readlines, but I can't establish a pattern to get started.
python regex
python regex
asked Mar 24 at 7:12
NDTBNDTB
254
254
3
Please look into using an XML/HTML parser in Python, which I believe natively supports them. Using regex to parse HTML is generally evil. Many kittens will be dying if you continue doing this.
– Tim Biegeleisen
Mar 24 at 7:19
add a comment |
3
Please look into using an XML/HTML parser in Python, which I believe natively supports them. Using regex to parse HTML is generally evil. Many kittens will be dying if you continue doing this.
– Tim Biegeleisen
Mar 24 at 7:19
3
3
Please look into using an XML/HTML parser in Python, which I believe natively supports them. Using regex to parse HTML is generally evil. Many kittens will be dying if you continue doing this.
– Tim Biegeleisen
Mar 24 at 7:19
Please look into using an XML/HTML parser in Python, which I believe natively supports them. Using regex to parse HTML is generally evil. Many kittens will be dying if you continue doing this.
– Tim Biegeleisen
Mar 24 at 7:19
add a comment |
2 Answers
2
active
oldest
votes
I will start by not answering your question directly. I assume your question is a X-Y problem. In your case you have to deal with HTML, so you have plenty of powerful tools made for that.
Take a look at BeautifulSoup for Python:
from bs4 import BeautifulSoup
soup = BeautifulSoup(content, 'html.parser')
From this soup
you can then extract whatever you need.
Now from your question, if you still want to use regular expressions you can use https://regex101.com to help you:
Demo: https://regex101.com/r/AcozoW/1
<p.*?Notes.*?<li>(.+?)</li>
add a comment |
I would recommend the package beautifulsoup to do this. You could try something like this:
import requests
from bs4 import BeautifulSoup
data = requests.get('put website address here')
soup = BeautifulSoup(data.text, 'html.parser')
for i in soup.find_all('h2', 'class':'hbk-heading'):
print(i.text.strip())
I tried this, but I have a txt file not a link? I am required to use RegEx for this
– NDTB
Mar 24 at 8:14
add a comment |
Your Answer
StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55321503%2fusing-regex-in-python-to-obtain-multiple-repeating-lines%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
2 Answers
2
active
oldest
votes
2 Answers
2
active
oldest
votes
active
oldest
votes
active
oldest
votes
I will start by not answering your question directly. I assume your question is a X-Y problem. In your case you have to deal with HTML, so you have plenty of powerful tools made for that.
Take a look at BeautifulSoup for Python:
from bs4 import BeautifulSoup
soup = BeautifulSoup(content, 'html.parser')
From this soup
you can then extract whatever you need.
Now from your question, if you still want to use regular expressions you can use https://regex101.com to help you:
Demo: https://regex101.com/r/AcozoW/1
<p.*?Notes.*?<li>(.+?)</li>
add a comment |
I will start by not answering your question directly. I assume your question is a X-Y problem. In your case you have to deal with HTML, so you have plenty of powerful tools made for that.
Take a look at BeautifulSoup for Python:
from bs4 import BeautifulSoup
soup = BeautifulSoup(content, 'html.parser')
From this soup
you can then extract whatever you need.
Now from your question, if you still want to use regular expressions you can use https://regex101.com to help you:
Demo: https://regex101.com/r/AcozoW/1
<p.*?Notes.*?<li>(.+?)</li>
add a comment |
I will start by not answering your question directly. I assume your question is a X-Y problem. In your case you have to deal with HTML, so you have plenty of powerful tools made for that.
Take a look at BeautifulSoup for Python:
from bs4 import BeautifulSoup
soup = BeautifulSoup(content, 'html.parser')
From this soup
you can then extract whatever you need.
Now from your question, if you still want to use regular expressions you can use https://regex101.com to help you:
Demo: https://regex101.com/r/AcozoW/1
<p.*?Notes.*?<li>(.+?)</li>
I will start by not answering your question directly. I assume your question is a X-Y problem. In your case you have to deal with HTML, so you have plenty of powerful tools made for that.
Take a look at BeautifulSoup for Python:
from bs4 import BeautifulSoup
soup = BeautifulSoup(content, 'html.parser')
From this soup
you can then extract whatever you need.
Now from your question, if you still want to use regular expressions you can use https://regex101.com to help you:
Demo: https://regex101.com/r/AcozoW/1
<p.*?Notes.*?<li>(.+?)</li>
answered Mar 24 at 7:57
nowoxnowox
7,114945112
7,114945112
add a comment |
add a comment |
I would recommend the package beautifulsoup to do this. You could try something like this:
import requests
from bs4 import BeautifulSoup
data = requests.get('put website address here')
soup = BeautifulSoup(data.text, 'html.parser')
for i in soup.find_all('h2', 'class':'hbk-heading'):
print(i.text.strip())
I tried this, but I have a txt file not a link? I am required to use RegEx for this
– NDTB
Mar 24 at 8:14
add a comment |
I would recommend the package beautifulsoup to do this. You could try something like this:
import requests
from bs4 import BeautifulSoup
data = requests.get('put website address here')
soup = BeautifulSoup(data.text, 'html.parser')
for i in soup.find_all('h2', 'class':'hbk-heading'):
print(i.text.strip())
I tried this, but I have a txt file not a link? I am required to use RegEx for this
– NDTB
Mar 24 at 8:14
add a comment |
I would recommend the package beautifulsoup to do this. You could try something like this:
import requests
from bs4 import BeautifulSoup
data = requests.get('put website address here')
soup = BeautifulSoup(data.text, 'html.parser')
for i in soup.find_all('h2', 'class':'hbk-heading'):
print(i.text.strip())
I would recommend the package beautifulsoup to do this. You could try something like this:
import requests
from bs4 import BeautifulSoup
data = requests.get('put website address here')
soup = BeautifulSoup(data.text, 'html.parser')
for i in soup.find_all('h2', 'class':'hbk-heading'):
print(i.text.strip())
answered Mar 24 at 7:59
AnnaBAnnaB
7617
7617
I tried this, but I have a txt file not a link? I am required to use RegEx for this
– NDTB
Mar 24 at 8:14
add a comment |
I tried this, but I have a txt file not a link? I am required to use RegEx for this
– NDTB
Mar 24 at 8:14
I tried this, but I have a txt file not a link? I am required to use RegEx for this
– NDTB
Mar 24 at 8:14
I tried this, but I have a txt file not a link? I am required to use RegEx for this
– NDTB
Mar 24 at 8:14
add a comment |
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55321503%2fusing-regex-in-python-to-obtain-multiple-repeating-lines%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
3
Please look into using an XML/HTML parser in Python, which I believe natively supports them. Using regex to parse HTML is generally evil. Many kittens will be dying if you continue doing this.
– Tim Biegeleisen
Mar 24 at 7:19