MIT143102016-V034000 - 雙語字幕

Now, let's think about harvesting data.
So two ways to harvest data.
First, you harvest data that's already there.
You go with your software.
One you need to find is like a combined tractor or whatever.
It's already there.
We don't need to plant it.
And then sometimes you actually need to plant it first.
That takes a little bit more time sometimes, but that's also useful.
So let me go over these two.
So what's grape scraping?
It could be full data from some data from some It could be calling an entire webpage, maybe from form to form.
It could be a set of forms running in the background,
so I'll give you an example where you keep a sort of querying,
for example, the of books, and any of the above in an ongoing fashion where you keep sending.
What does crawling mean?
Crawling means searching all over the website.
So in a page, you could just find one page and extract what you want from one page.
Or you could just keep going to the website and extract,
like, go through, for example, If you want to look for the, you want to construct the database of the price of
light, calling will involve going from, you know, keep querying.
This is, you know, Boston L.A.
And then Boston to Phoenix and Boston to San Diego.
So you're calling the site to find out what it is.
Like a color.
So to get data from the internet, that's not already in available form.
There are two big classes of methods to do that.
First of all, many of the big websites maintain what's called an API.
our application program interface, which basically help a particular program or a particular application to communicate with other applications.
So if you want data from Twitter,
or you want data from Facebook,
or you want data from Google Map,
you will not physically go on each of the website and then copy the information and put it in your database.
I mean, you could, and I'll show you a little bit how to do it later.
But it's not typically the way you would do it.
Typically something like Google Map,
I'll show you an example with Google Map,
but will have an API which is, they will directly give you the data that you need upon receiving a query from you.
Sometimes it is free,
sometimes it is, you have to pay, sometimes it is free for a little while, and then you have to pay for the rest.
Sometimes you first need authorization from the person, from the data you want, etc.
The Twitter,
a lot of the data is public anyway, because once you can follow someone and once you follow them, you can get their news.
So a lot of data from Twitter is perfectly fine.
And way you would do that is not by actually doing it.
scraping from the Twitter page the way you would do that by communicating with Twitter to an API and telling exactly what you need.
Say you want all of the tweet for
翻譯語言
選擇翻譯語言

解鎖更多功能

安裝 Trancy 擴展,可以解鎖更多功能,包括AI字幕、AI單詞釋義、AI語法分析、AI口語等

feature cover

兼容主流視頻平台

Trancy 不僅提供對 YouTube、Netflix、Udemy、Disney+、TED、edX、Kehan、Coursera 等平台的雙語字幕支持,還能實現對普通網頁的 AI 劃詞/劃句翻譯、全文沉浸翻譯等功能,真正的語言學習全能助手。

支持全平臺瀏覽器

Trancy 支持全平臺使用,包括iOS Safari瀏覽器擴展

多種觀影模式

支持劇場、閱讀、混合等多種觀影模式,全方位雙語體驗

多種練習模式

支持句子精聽、口語測評、選擇填空、默寫等多種練習方式

AI 視頻總結

使用 OpenAI 對視頻總結,快速視頻概要,掌握關鍵內容

AI 字幕

只需3-5分鐘,即可生成 YouTube AI 字幕,精準且快速

AI 單詞釋義

輕點字幕中的單詞,即可查詢釋義,並有AI釋義賦能

AI 語法分析

對句子進行語法分析,快速理解句子含義,掌握難點語法

更多網頁功能

Trancy 支持視頻雙語字幕同時,還可提供網頁的單詞翻譯和全文翻譯功能

開啟語言學習新旅程

立即試用 Trancy,親身體驗其獨特功能

立即下載