Spaces:
Running
Running
![web scraping with python](https://github.com/rajat4665/web-scraping-with-python/blob/master/WEB%20SCRAPING.jpg) | |
<br> | |
<span style="text-decoration: underline;"><strong>Introduction:</strong></span> | |
<b>Web scraping</b>, <b>web harvesting</b>, or <b>web data extraction</b> is data scraping used for extracting data from websites using its HTML structure, In this post, I will explain basic fundaments of web scraping using python and also explore it by a live demonstration with two python libraries Beautifulsoup and requests respectively. | |
<span style="text-decoration: underline;"><strong>What you will learn from this post:</strong></span> | |
<ul> | |
<li>basic understanding of web scraping</li> | |
<li>how to extract data from a website using classes and HTML tags</li> | |
<li>how to use requests module to get data</li> | |
<li>how to use Beautifulsoup</li> | |
</ul> | |
<span style="text-decoration: underline;"><strong>Requirements:</strong></span> | |
<ul> | |
<li>python3</li> | |
<li>requests</li> | |
<li>bs4</li> | |
</ul> | |
<h3>Install required dependencies :</h3> | |
<ul> | |
<li>clone or download it from <a href="https://github.com/rajat4665/web-scraping-with-python" target="_blank" rel="noopener">here</a></li> | |
<li>install requirements.txt file</li> | |
<li><code>pip install -r requirements.txt</code></li> | |
</ul> | |
<h2> How to run this code</h2> | |
<ul> | |
<li>there are two source code files, one is .py extention and another is .ipynb extention</li> | |
<li>one can run Scraping with BeautifulSoup.py file in python by run this cammand in terminal "python3 Web Scraping with BeautifulSoup.py"</li> | |
<li>one can run Scraping with BeautifulSoup.ipynb file in jupyter notebook /li> | |
<li>one can install juypyter notebook by this command "pip3 install jupyter"</li> | |
<li> CLI scraping tool is underdevelopment only beta version is available now </li> | |
</ul> | |
---------------------------------------------------------------------------------------- | |
<h1>HAPPY CODING</h1> | |