課程簡介 Course Introduction
|
開課年度學期 Year / Term
|
113 學年度 第 1 學期
|
開課班級 Department
|
電機工程學系碩士班 電機系碩一二合
|
授課方式 Instructional Method
|
課堂教學 、 中文
|
課程電腦代號 Course Reference Number
|
182007
|
課程名稱(中文) Course Title(Chinese)
|
智慧型系統專論
|
課程名稱(英文) Course Title(English)
|
Special Topics on Intelligent Systems
|
學分數/時數 Credit Hours
|
3 /
3
|
必(選)修 Requirement / Elective Course
|
選修
|
授課老師 Instructor
|
黃崇能
|
助教 Teaching Assistant
|
|
上課時間 Meeting Time
|
星期一,節次7、8、9
|
上課教室 Classroom
|
ZB209
|
Office Hours
|
|
獲獎及補助情形 Awards and Grants |
|
聯合國永續發展目標 (SDGs跨域類別) Sustainable Development Goals, SDGs |
SDGs 04.
|
優質教育:確保有教無類、公平以及高品質的教育,及提倡終身學習
|
SDGs 09.
|
工業化、創新及基礎建設:建立具有韌性的基礎建設,促進包容且永續的工業,並加速創新
|
SDGs 11.
|
永續城鄉:建構具包容、安全、韌性及永續特質的城市與鄉村
|
|
課程目標 Learning Objectives
|
Web crawling is a powerful technique to collect data from the web by finding all the URLs for one or multiple domains. Python has several popular web crawling libraries and frameworks.
In this course, we will first introduce different crawling strategies and use cases. Then we will build a simple web crawler from scratch in Python using two libraries: requests and Beautiful Soup. Next, we will see why it’s better to use a web crawling framework like Scrapy. Finally, we will build an example crawler with Scrapy to collect film metadata from IMDb and see how Scrapy scales to websites with several million pages.
|
先修 ( 前置 ) 課程 Prerequisite
|
python programming
|
彈性教學規劃 Flexible Teaching/Planning Schedules |
*本課程實施16+2週彈性教學方案,其中第17、18週之彈性規劃如下: |
|
課程大綱 Course Syllabus
|
週次 Week |
課程單元大綱 Unit |
教學方式 Instructional Method/Style/Teaching Style |
參考資料或相關作業 References or Related Materials |
評量方式 Grading |
1
|
Web crawling strategies
|
ppt&programming
|
|
|
2
|
Building a simple web crawler in Python from scratch
|
ppt&programming
|
|
|
3
|
Web crawling with Scrapy
|
ppt&programming
|
|
|
4
|
cont.
|
ppt&programming
|
|
|
5
|
Building an example Scrapy crawler for IMDb
|
ppt&programming
|
|
|
6
|
cont.
|
ppt&programming
|
|
|
7
|
cont.
|
ppt&programming
|
|
|
8
|
Web crawling at scale
|
ppt&programming
|
|
|
9
|
middle test
|
programming
|
|
|
10
|
EXTENDING SCRAPY
|
ppt&programming
|
|
|
11
|
Spiders
|
ppt&programming
|
|
|
12
|
Spider arguments
|
ppt&programming
|
|
|
13
|
Generic Spiders
|
ppt&programming
|
|
|
14
|
CrawlSpider
|
ppt&programming
|
|
|
15
|
Crawling rules
|
ppt&programming
|
|
|
16
|
CrawlSpider example
|
ppt&programming
|
|
|
17
|
XMLFeedSpider
|
ppt&programming
|
|
|
18
|
final test
|
programming
|
|
|
單一課程對應校能力指標程度 The Degree to Which Single Course Corresponds to School Competence
|
編號 No. |
校核心能力 School Core Competencies |
符合程度 Degree of conformity |
1
|
公民力 (Citizen)
|
4
|
2
|
自學力 (Self-learning)
|
5
|
3
|
資訊力 (Information)
|
5
|
4
|
創造力 (Creativity)
|
5
|
5
|
溝通力 (Communication)
|
5
|
6
|
就業力(Employability)
|
5
|
單一課程對應系能力指標程度 The Degree to Which Single Course Corresponds to Department Competence
|
編號 No. |
類別 Category |
系核心能力 Department Core Competencies |
符合程度 Degree of conformity |
01
|
系所
|
具備專業知識運作及運用之能力
|
0
|
02
|
系所
|
發掘問題、實驗分析及驗證之能力
|
0
|
03
|
系所
|
創新思考開發之能力
|
0
|
04
|
系所
|
協調合作、領導團隊與管理規劃之能力
|
0
|
05
|
系所
|
中英文寫作與簡報之能力
|
0
|
06
|
系所
|
強化國際觀與國際交流之能力
|
0
|
單一課程對應院能力指標程度 The Degree to Which Single Course Corresponds to College Competence
|
編號 No. |
院核心能力 College Core Competencies |
符合程度 Degree of conformity |
1
|
語文能力
|
5
|
2
|
溝通與合作能力
|
5
|
3
|
創新與實踐能力
|
5
|
4
|
專業知能
|
5
|
教科書或參考用書 Textbooks or Reference Books
|
館藏書名 Library Books
|
備註 Remarks
|
Python Web Scraping: Hands-on data scraping and crawling using PyQT, Selnium, HTML and Python, 2nd Edition 2nd Revised edition ISBN-10 1786462583
|
※請尊重智慧財產權,不得非法影印教科書※
※ Please respect intellectual property rights and do not illegally photocopy textbooks. ※
教學方法 Teaching Method
|
教學方法 Teaching Method
|
百分比 Percentage
|
講述
|
60 %
|
專題實作
|
30 %
|
問題導向學習
|
10 %
|
總和 Total |
100 % |
成績評量方式 Grading
|
評量方式 Grading |
百分比 Percentage |
期中考
|
30 %
|
期末考
|
40 %
|
出席狀況
|
10 %
|
作業撰寫
|
20 %
|
總和 Total |
100 % |
課程大綱補充資料 Supplementary Material of Course Syllabus
|
|
|