ӭ
ղرվ
 
 
ǰλ<<վҳ<<ϸϢ

100ôݴʻӢĶձ

ʱ䣺2020-09-30 1587


100ôݴʻӢĶձ

IT168 ôݴʻӢĶձ

A

ۺ(Aggregation) C ϲʾݵĹ

㷨(Algorithms) C ijݷѧʽ

(Analytics) C ڷݵں

쳣(Anomaly detection) C ݼԤģʽΪƥˡAnomalies,ʾ쳣Ĵ¼֣outliers, exceptions, surprises, contaminants.ͨṩؼĿִϢ

(Anonymization) C ʹƳ˽ص

Ӧ(Application) C ʵijضܵļ

˹(Artificial Intelligence) C зܻЩ豸ܹ֪ĻҪӦķӦѧϰ

B

Ϊ(Behavioural Analytics) C ַǸûΪ硰ôΪʲôôԼʲôóۣǽʱһŷѧƣеԻģʽ

ݿѧ(Big Data Scientist) C ܹƴ㷨ʹôݱõ

ݴҵ˾(Big data startup) C ָз´ݼ˹˾

ⶨ(Biometrics) C ݸ˵ʶ

Bֽ (BB: Brontobytes) C Լ1000 YB(Yottabytes)൱δֻĴС1 Bֽڰ270!

ҵ(Business Intelligence) C һϵۡѧ͹̣ʹݸױ

C

(Classification analysis) C лҪϢϵͳ; ҲΪԪ(meta data),ݵ

Ƽ(Cloud computing) C ϵķֲʽϵͳǴ洢ڻ(ƶ)

(Clustering analysis) C ǽƵĶۺһÿƵĶϳһ(Ҳ)Ĺַ̡ĿڷݼIJ

ݴ洢(Cold data storage) C ڵ͹ķϴ洢ЩʹõľݡЩݼܺʱ

Աȷ(Comparative analysis) C ڷdzݼнģʽƥʱһĶԱȺͼ̵õ

ӽṹ(Complex structured data) C Ӷ໥ɵݣݲܼ򵥵ɽṹѯԻ򹤾(SQL)

(Computer generated data) C ־ļɼɵ

(Concurrency) C ͬʱִжж

Է(Correlation analysis) C һݷڷ֮Ƿأ߸

ͻϵ(CRM: Customer Relationship Management) C ڹۡҵ̵һּݽӰ칫˾ĿͻϵIJ

D

DZ(Dashboard) C ʹ㷨ݣͼʽʾDZ

ݾۺϹ(Data aggregation tools) C ɢڶԴתһȫԴĹ

ݷʦ(Data analyst) C ݷģרҵԱ

ݿ(Database) C һijضļ洢ݼϵIJֿ

ݿ⼴(Database-as-a-Service) C ƶ˵ݿ⣬üѷƷ(AWS: Amazon Web Services)

ݿϵͳ(DBMS: Database Management System) C ռ洢ݣṩݵķ

(Data centre) C һʵص㣬洢ݵķ

ϴ(Data cleansing) C ݽУḶ́ĿɾظϢڵĴ󣬲ṩһ

ݹԱ(Data custodian) C άݴ洢輼רҵԱ

ݵ׼(Data ethical guidelines) C Щ׼֯ʹ͸֤ݵļࡢȫ˽

ݶ(Data feed) C һTwitterĺRSS

ݼ(Data marketplace) C ݼ߽׳

ھ(Data mining) C ݼзضģʽϢĹ

ݽģ(Data modelling) C ʹݽģݶԴ˶Ϥݵں

ݼ(Data set) C ݵļ

⻯(Data virtualization) C ϵḶ́Դ˻øϢͨݿ⣬Ӧóļϵͳҳݼȵ

ȥʶ(De-identification) C ҲΪ(anonymization)ȷ˲ͨݱʶ

б(Discriminant analysis) C ݷ;ͬķ෽ʽɽݷ䵽ͬȺ飬Ŀ¼һͳƷԶijЩȺȺ֪Ϣзлȡ

ֲʽļϵͳ(Distributed File System) C ṩ򻯵ģ߿õķʽ洢ݵϵͳ

ļݿ(Document Store Databases) C ֳΪĵݿ(document-oriented database), Ϊ洢ָĵݶרƵݿ⣬ĵҲΪṹ

E

̽Է(Exploratory analysis) C ûб׼̻򷽷´зģʽһַݺݼҪԵһַ

Eֽ(EB: Exabytes) C Լ1000 PB(petabytes), Լ1 GBȫÿϢԼΪ1 EB

ȡ-ת-(ETL: Extract, Transform and Load) C һݿݲֿĴ̡ӸֲͬԴȡ(E)ݣת(T)ҵҪݣ(L)ݿ

F

л(Failover) C ϵͳijʱԶؽлһ÷ڵ

ݴ(Fault-tolerant design) C һ֧ݴƵϵͳӦܹijһֳֹҲܼ

G

Ϸ(Gamification) C ϷϷ˼άͻƣַһʮѺõķʽݵĴ⣬dzЧ

ͼݿ(Graph Databases) C ͼνṹ(磬һ޵ԣijʵ)洢ݣͼδ洢ṹԵԺͽڵ㡣ṩڽڵܣҲ˵ݿÿԪؼ䶼Ԫֱӹ

(Grid computing) C ֲڲͬصļһԴijض⣬ͨͨƽһ

H

Hadoop C һԴķֲʽϵͳܣڿֲʽ򣬽дݵ洢

Hadoopݿ(HBase) C һԴġǹϵֲ͡ʽݿ⣬Hadoopܹͬʹ

HDFS C Hadoopֲʽļϵͳ(Hadoop Distributed File System);һƳʺͨӲ(commodity hardware)ϵķֲʽļϵͳ

ܼ(HPC: High-Performance-Computing) C ʹó临ӵļ

I

ڴݿ(IMDB: In-memory) C һݿϵͳͨݿϵͳ֮ͬڣ洢ݣӲ̡صܸٵؽݵĴʹȡ

(Internet of Things) C ͨ豸װϴʹЩ豸ܹκʱκεص

J

ϵһ(Juridical data compliance) C ʹõƼݴ洢ڲͬĹһͬĴ½ʱͻϹϵˡҪЩ洢ڲͬҵǷϵصķɡ

K

ֵݿ(KeyValue Databases) C ݵĴ洢ʽʹһضļָһضݼ¼ַʽʹݵIJҸӷݡֵݿͨΪл͵ݡ

L

ӳ(Latency) C ʾϵͳʱӳ

ϵͳ(Legacy system) C һ־ɵӦó򣬻ǾɵļǾɵļϵͳѾ֧ˡ

ؾ(Load balancing) C 䵽̨ԻϣԻŽϵͳʡ

λϢ(Location data) C GPSϢλϢ

־ļ(Log file) C ɼϵͳԶɵļ¼ϵͳй̡

M

M2M(Machine2Machine data) C ̨̨佻봫

(Machine data) C ɴ㷨ڻϲ

ѧϰ(Machine learning) C ˹ܵһָ֣ǻܹɵнѧϰͨڵۻʵҸĽ

MapReduce C Ǵģݵһ(Map: ӳ䣬Reduce: )

ģд(MPP: Massively Parallel Processing) C ͬʱʹö(̨)ͬһ

Ԫ(Metadata) C Ϊݵݣ(ʲô)Ϣ

MongoDB C һֿԴķǹϵݿ(NoSQL database)

άݿ(Multi-Dimensional Databases) C Ż(OLAP)Żݲֿһݿ⡣

ֵݿ(MultiValue Databases) C һַǹϵݿ(NoSQL), һĶάݿ⣺ܴ3άȵݡҪԷdzַܹشHTMLXMLеִ

N

ȻԴ(Natural Language Processing) C Ǽѧһ֧оʵּ֮Ľ

(Network analysis) C ͼнڵĹϵнڵӺǿȹϵ

NewSQL C һŵġõݿϵͳSQLѧϰʹãNoSQLݿ

NoSQL C ˼壬ǡʹSQLݿ⡣ݿָⷺͳϵݿ͵ݿ⡣ݿиǿһԣܴģ͸߲ݡ

O

ݿ(Object Databases) C (ҲΪݿ)Զʽ洢ݣ̡ͬڹϵݿͼݿ⣬󲿷ֶݿⶼṩһֲѯԣʹʽ(declarative programming)ʶ.

ڶͼ(Object-based Image Analysis) C ͼǶÿһصݽзڶͼֻصݣЩرΪͼ

ݿ(Operational Databases) C ݿһ֯ijҵӪdzҪһʹû ռ˾ڲľϢ

Ż(Optimization analysis) C ڲƷ㷨ʵֵẒ̇һУ˾ƸָIJƷЩƷǷԤֵ

(Ontology) C ʾ֪ʶ壬ڶһеĸ֮Ĺϵһѧ˼롣(ע: ݱߵѧĸ߶ȣ籾壬ΪһĿ͹)

쳣ֵ(Outlier detection) C 쳣ֵָƫһݼһƽֵĶ󣬸öݼеȥԶˣ쳣ֵijζϵͳ⣬ҪԴӷ

P

ģʽʶ(Pattern Recognition) C ͨ㷨ʶеģʽͬһԴеԤ

Pֽ(PB: Petabytes) C Լ1000 TB(terabytes), Լ1 GB (gigabytes)ŷ޺о(CERN)ǿӶײÿӸԼΪ1 PB

ƽ̨(PaaS: Platform-as-a-Service) C ΪƼṩбĻƽ̨һַ

Ԥ(Predictive analysis) C ݷмֵһַַԤδ()Ϊij˺ܻܿijЩƷܻijЩվijЩ߲ijΪͨʹøֲͬݼʷݣݣ罻ݣ߿ͻĸϢݣʶպͻ

˽(Privacy) C ѾпʶϢݷ뿪ȷû˽

(Public data) C ɹ𴴽ĹϢ򹫹ݼ

Q

ֻ(Quantified Self) C ʹӦóûһһһӶõصΪ

ѯ(Query) C ij𰸵Ϣ

R

ʶ(Re-identification) C ݼϲһ𣬴ʶϢ

ع(Regression analysis) C ȷϵַ֮ڵϵ(עԱ߲ɻ)

RFID C Ƶʶ; ʶʹһ߷ǽӴʽƵų

ʵʱ(Real-time data) C ָڼڱ洢ʾ

Ƽ(Recommendation engine) C Ƽ㷨û֮ǰĹΪΪûƼijֲƷ

·(Routing analysis) C ij䷽ͨʹöֲͬıӶҵһ·ԴﵽȼϷãЧʵĿ

S

ṹ(Semi-structured data) C ṹݲнṹϸĴ洢ṹʹñǩʽıǷʽԱ֤ݵIJνṹ

з(Sentiment Analysis) C ͨ㷨οijЩ

źŷ(Signal analysis) C ָͨʱռ仯Ʒܡرʹôݡ

(Similarity searches) C ݿвѯƵĶ˵ݶ͵

(Simulation analysis) C ָģʵн̻ϵͳIJڷʱǶֲͬıȷƷܴﵽ

(Smart grid) C ָԴʹôʵʱ״̬Ч

(SaaS: Software-as-a-Service) C WebͨʹõһӦ

ռ(Spatial analysis) C ռϢϢռݣеóֲڵռеݵģʽ͹

SQL C ڹϵݿУڼݵһֱ

ṹ(Structured data) -֯нṹʶݡͨһ¼һļDZȷǹеijһֶΣҿԱȷضλ

T

Tֽ(TB: Terabytes) C Լ1000 GB(gigabytes)1 TBԴ洢Լ300СʱĸƵ

ʱ(Time series analysis) C ظʱõĶõݡݱöģҪȡͬʱʱ㡣

ݷ(Topological Data Analysis) C ݷҪע㣺ģ͡ȺʶԼݵͳѧ塣

(Transactional data) C ʱ仯Ķ̬

͸(Transparency) C Ҫ֪ǵʲôáδ֯ЩϢ͸ˡ

U

ǽṹ(Un-structured data) C ǽṹһ㱻ΪǴıݣлܰڣֺʵ

V

ֵ(Value) C (ע4Vص֮һ) пõݣΪ֯ᡢߴ޴ļֵζŸҵҵӴл档

ɱ(Variability) C Ҳ˵ݵĺ()仯ġ磬һͬпȫͬ˼

(Variety) C (ע4Vص֮һ) Ըֲͬʽ֣ṹݣṹݣǽṹݣиӽṹ

(Velocity) C (ע4Vص֮һ) ڴʱݵĴ洢⻯Ҫ󱻸ٴ

ʵ(Veracity) C ֯Ҫȷݵʵԣܱ֤ݷȷԡˣʵ(Veracity)ָݵȷԡ

ӻ(Visualization) C ֻȷĿӻԭʼݲſɱͶʹáġӻͨͼͻͼӻָǵĸӵͼͼаϢԱ׵Ķ

(Volume) C (ע4Vص֮һ) ָΧMegabytesBrontobytes

W

(Weather data) C һҪĿŹԴԴϳһ𣬿Ϊ֯ṩ

X

XMLݿ(XML Databases) C XMLݿһXMLʽ洢ݵݿ⡣XMLݿͨĵݿԱԶXMLݿݽвѯԼָĸʽл

Y

Yֽ (Yottabytes) C Լ1000 ZB (Zettabytes), Լ250DVDֻֽΪ1 YB, ҽÿ18귭һ

Z

Zֽ (ZB: Zettabytes) C Լ1000 EB (Exabytes), Լ1 TBԤ⣬2016ȫΧÿͨϢԼܴﵽ1 ZB

洢λ

1 Bit() = Binary Digit

8 Bits = 1 Byte(ֽ)

1,000 Bytes = 1 Kilobyte

1,000 Kilobytes = 1 Megabyte

1,000 Megabytes = 1 Gigabyte

1,000 Gigabytes = 1 Terabyte

1,000 Terabytes = 1 Petabyte

1,000 Petabytes = 1 Exabyte

1,000 Exabytes = 1 Zettabyte

1,000 Zettabytes = 1 Yottabyte

1,000 Yottabytes = 1 Brontobyte

1,000 Brontobytes = 1 Geopbyte

0
ͳ
˼
ţICP19031614-1