Crawler-WebImage

Google과 Naver의 검색 이미지들을 한번에 원하는 수 만큼 다운로드해주는 크롤링 프로젝트

ARMY 11개월 기념 파이썬 프로젝트

기간: 2020.12.01 ~ 2020.12.13
참고 기술블로그

Example

For Static Site

For Dynamic Site (Recommendation)

Dependency Module - 사용 전에 꼭 ! 😆

정적 페이지를 위한 크롤링 파일은 Beautiful Soup, urllib를, 동적 페이지를 위한 크롤링 디렉토리는 Selenium, Requests를 주로 이용함

When Use on Static Site(crawlingGoogle.py, crawlingNaver.py)

pip install urllib
pip install bs4

When Use on Dynamic Site(Crawler-GoogleImg, Crawler-NaverImg)

pip install selenium
pip install urllib
pip install Pillow

install Chrome
install ChromeDriver

ChromeDriver Download

https://chromedriver.chromium.org/

ChromeDriver Download on groomIDE

https://help.goorm.io/en/goormide/18.faq/language-and-environment/selenium-chromewebdriver#check-the-chrome-version

How to use

정적 사이트에서의 크롤링: Crawler-WebImage/Crawler_StaticSiteImg/crawlingGoogle.py, crawlingNaver.py

0. images 디렉토리를 생성
1. crawlingGoogle.py 또는 crawlingNaver.py의 keyword에 검색어를 입력
2. 터미널에 python crawlingGoogle.py 또는 crawlingNaver.py를 입력
3. 이미지 다운로드 확인

Example

keyword = "your keyword"

동적 사이트에서의 크롤링: Crawler-WebImage/main.py

0. images 디렉토리를 생성
1. main.py 파일 열기
2. 검색할 키워드, 다운로드 받을 이미지 수, 크롤링 할 사이트를 입력
3. 터미널에 python main.py 입력

Example

# 검색할 키워드 입력
name_list = ["서울"]

# 총 다운로드 받을 이미지 수 - 100을 수정
max_image = 200

# 크롤링 할 사이트 입력 - 1. Google 2. Naver 3. Both
crawler_site = "Both"

Architecture

main > search_and_download > get_image_links > download_image

Functions

from search_and_download import search_and_download
from get_image_links import fetch_image_urls
from download_image import persist_image

Name	Description
`main`	검색할 키워드와 이미지 수를 입력
`search_and_download`	이미지 검색, 저장 함수를 호출하는 함수
`fetch_image_urls`	이미지의 url을 가져오는 함수
`persist_image`	이미지를 정해진 경로에 저장하는 함수

Developer

박길현

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
Crawler_GoogleImg		Crawler_GoogleImg
Crawler_NaverImg		Crawler_NaverImg
Crawler_StaticSiteImg		Crawler_StaticSiteImg
README.md		README.md
goorm.manifest		goorm.manifest
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Crawler-WebImage

Example

For Static Site

For Dynamic Site (Recommendation)

Dependency Module - 사용 전에 꼭 ! 😆

ChromeDriver Download

ChromeDriver Download on groomIDE

How to use

Example

Example

Architecture

Functions

Developer

About

Uh oh!

Releases

Packages

Languages

ureChanger/Crawler-WebImage

Folders and files

Latest commit

History

Repository files navigation

Crawler-WebImage

Example

For Static Site

For Dynamic Site (Recommendation)

Dependency Module - 사용 전에 꼭 ! 😆

ChromeDriver Download

ChromeDriver Download on groomIDE

How to use

Example

Example

Architecture

Functions

Developer

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages