Pythonbot / readme.md
Bloodlyghoul's picture
Rename README.md to readme.md
4879bd9 verified
|
raw
history blame
1.94 kB
![web scraping with python](https://github.com/rajat4665/web-scraping-with-python/blob/master/WEB%20SCRAPING.jpg)
<br>
<span style="text-decoration: underline;"><strong>Introduction:</strong></span>
<b>Web scraping</b>, <b>web harvesting</b>, or <b>web data extraction</b> is data scraping used for extracting data from websites using its HTML structure, In this post, I will explain basic fundaments of web scraping using python and also explore it by a live demonstration with two python libraries Beautifulsoup and requests respectively.
<span style="text-decoration: underline;"><strong>What you will learn from this post:</strong></span>
<ul>
<li>basic understanding of web scraping</li>
<li>how to extract data from a website using classes and HTML tags</li>
<li>how to use requests module to get data</li>
<li>how to use Beautifulsoup</li>
</ul>
<span style="text-decoration: underline;"><strong>Requirements:</strong></span>
<ul>
<li>python3</li>
<li>requests</li>
<li>bs4</li>
</ul>
<h3>Install required dependencies :</h3>
<ul>
<li>clone or download it from <a href="https://github.com/rajat4665/web-scraping-with-python" target="_blank" rel="noopener">here</a></li>
<li>install requirements.txt file</li>
<li><code>pip install -r requirements.txt</code></li>
</ul>
<h2> How to run this code</h2>
<ul>
<li>there are two source code files, one is .py extention and another is .ipynb extention</li>
<li>one can run Scraping with BeautifulSoup.py file in python by run this cammand in terminal "python3 Web Scraping with BeautifulSoup.py"</li>
<li>one can run Scraping with BeautifulSoup.ipynb file in jupyter notebook /li>
<li>one can install juypyter notebook by this command "pip3 install jupyter"</li>
<li> CLI scraping tool is underdevelopment only beta version is available now </li>
</ul>
----------------------------------------------------------------------------------------
<h1>HAPPY CODING</h1>