MIT143102016-V035500 - 雙語字幕

So there are many data sets, many websites that are kind of specializing on data and make a lot of data available.
One that I really like is the 538,
the data set that was started by Nate Silver, the data set, the website that was started by Nate Silver.
Five, thirty, eight, stands for what?
Yeah.
Number of electoral college votes, though.
Nate Silver,
who been a sort of a statistician for baseball specialists,
moved to using aggregating the polls to think about giving probability of success in various elections, has been quite successful at it.
In fact,
he was the only person that he went anywhere that I read,
who was reasonably who put a reasonably high probability on the fact that the 2016 action went the way it did.
So we had a lot of very interesting discussion on how to use data for politics and for
But the website is much much much much more than politics in particular.
It has a lot of sports data But it is just a of like data about
on them stuff
for example last year a student in this class did a paper on
Flight it's get there was some like survey of people on what do they think?
What do they think is polite or impolite to do in a flight?
You'd be surprised by the number of people who think that it's impolite to bring a kid in the flight
So this was what I remember from former people So 538 I really would go and visit first of all if you're interested in data.
There lots of interesting discussion And so there were lots of data sets for this class.
These are set usually not very big,
kind of easy to go around, kind of it could be a good place to start and play with some time.
Recently, so last year about this time, Yahoo started the data dump.
which you can find there, which is at the time was the largest data set available in one terabyte of data.
This is You know,
how people relate to the news,
so I haven't really looked into it,
but if you want to play with big data, that's big data right there that you can just use and download.
Very, very soon, so Uber, I think it's fired by Yahoo, decided that why can't we do that, too?
And then they have started to make available from now, it's for city planner.
But in about a month,
there will be an extract for the general public, and can put your name on the web on the waiting list.
It might not be enough for this class, but it might help you.
This is going to, it's called Uber movement, and it's going to be based on a very, very large data set on people.
So people driving movement, so there's probably going to be a lot to learn from that.
I think when we teach this, I this year from this one.
Over time, we'll have more of those.
getting used to seeing how this data looks like is there.
Two other data sets usually, a lot of people in this class like to work with bots data.
You're welcome to, you don't have to.
But you can.
And if you can, then they are a website that sort of specialized in integrating lots of sports data.
Something which is very, very helpful for some projects is the Wayback Machine, which is basically an archive of the Internet.
Not all of it is free,
but some of it So,
for example,
you can find all of the headlines of the New York Times for,
I mean, all of the front page of the New York Times from any, from, you know, going back a long period of time.
So, the way back machine can be useful.
If you're interested,
for example,
I'll show you a project today where they were interested I'm in
I'm interested your how the price of used books has changed over time with the introduction of more and more online sellers of those books,
etc., so that's the paper.
So, they use the way back machine to display a bunch of web pages for searches for a particular
book and then they use the some data technique, the scraping technique that I'm going to show you in a minute to extract them.
So, the way back machine is like something that is very good to be aware of that's more in combination with web scraping method.
And this is just like a snippet.
There is much more.
You can search in library catalog.
You search in Google.
If interested in something, at the movement, like access for many projects, you'd find something that might be appropriate.
翻譯語言
選擇翻譯語言

解鎖更多功能

安裝 Trancy 擴展,可以解鎖更多功能,包括AI字幕、AI單詞釋義、AI語法分析、AI口語等

feature cover

兼容主流視頻平台

Trancy 不僅提供對 YouTube、Netflix、Udemy、Disney+、TED、edX、Kehan、Coursera 等平台的雙語字幕支持,還能實現對普通網頁的 AI 劃詞/劃句翻譯、全文沉浸翻譯等功能,真正的語言學習全能助手。

支持全平臺瀏覽器

Trancy 支持全平臺使用,包括iOS Safari瀏覽器擴展

多種觀影模式

支持劇場、閱讀、混合等多種觀影模式,全方位雙語體驗

多種練習模式

支持句子精聽、口語測評、選擇填空、默寫等多種練習方式

AI 視頻總結

使用 OpenAI 對視頻總結,快速視頻概要,掌握關鍵內容

AI 字幕

只需3-5分鐘,即可生成 YouTube AI 字幕,精準且快速

AI 單詞釋義

輕點字幕中的單詞,即可查詢釋義,並有AI釋義賦能

AI 語法分析

對句子進行語法分析,快速理解句子含義,掌握難點語法

更多網頁功能

Trancy 支持視頻雙語字幕同時,還可提供網頁的單詞翻譯和全文翻譯功能

開啟語言學習新旅程

立即試用 Trancy,親身體驗其獨特功能

立即下載