Data Science an der IU Internationale Hochschule | Karteikarten & Zusammenfassungen

Lernmaterialien für Data Science an der IU Internationale Hochschule

Greife auf kostenlose Karteikarten, Zusammenfassungen, Übungsaufgaben und Altklausuren für deinen Data Science Kurs an der IU Internationale Hochschule zu.

TESTE DEIN WISSEN

How are true positive and true negative rate calculated?

Lösung anzeigen
TESTE DEIN WISSEN

TP rate = sensitivity = 1- FNR = 1 - (Type II errors/P) = TP/P

TN rate = specificity = 1 - FPR = 1 - (Type I errors/N) = TN/N

Type I error: false positive or FP

Type II error: false negative or FN

Lösung ausblenden
TESTE DEIN WISSEN

Which activites belong to the Data Flow dimension?

Lösung anzeigen
TESTE DEIN WISSEN
• data collection
• data storage
• data accessing
Lösung ausblenden
TESTE DEIN WISSEN

Which activites belong to the Data Curation dimension?

Lösung anzeigen
TESTE DEIN WISSEN
• data cleaning
• data presentation
• data evaluation
Lösung ausblenden
TESTE DEIN WISSEN

Which activites belong to the Data Analytics dimension?

Lösung anzeigen
TESTE DEIN WISSEN
• statistical analysis
• modeling & simulations
• visual techniques
Lösung ausblenden
TESTE DEIN WISSEN
What are the two data types?
Lösung anzeigen
TESTE DEIN WISSEN
quantitative: data are measurable values

qualitative: provide information about the quality of a good or a service

Lösung ausblenden
TESTE DEIN WISSEN

Describe the subtypes of quantitative data

Lösung anzeigen
TESTE DEIN WISSEN
• categorical nominal: no inherent order
• categorical ordinal: has an inherent order
• categeorical binary: divided in 2 categories
• discrete: data attribute = any digit in the numbering system
• continuous: data attribute = value within range
Lösung ausblenden
TESTE DEIN WISSEN

What are the five approaches to transform a dataset?

Lösung anzeigen
TESTE DEIN WISSEN
• logarithm tarnsformation
• power law transformation
• reciprocal transformation
• discrete fourier transform
Lösung ausblenden
TESTE DEIN WISSEN

How are the target variable and the output of the regression model related to each other?

Lösung anzeigen
TESTE DEIN WISSEN

y' = y + epsilon

The predicted output slightly differs vom the target because the relationship between the independent variables and the target variable is not exactly linear. Therefore we need to add the error term epsilon.

Lösung ausblenden
TESTE DEIN WISSEN

Compare Feature Engineering and Deep Learning

Lösung anzeigen
TESTE DEIN WISSEN

In FE we have to define our features, DL finds the features on its own.

Domain knowledge is needed to choose relevant features.

Lösung ausblenden
TESTE DEIN WISSEN

Compare batch and stream processing.

Lösung anzeigen
TESTE DEIN WISSEN

Batch = transmits data as a block; for example, retailer store

Stream = provides data immediately; for example, sensor data in industry

Lösung ausblenden
TESTE DEIN WISSEN

For what is AR used? And for what ARIMA?

Lösung anzeigen
TESTE DEIN WISSEN

AR = for data with an underlying linear relationship

ARIMA = for non-linear data; uses additional MA an integrated terms

Lösung ausblenden
TESTE DEIN WISSEN

What is the "least-squares method"?

Lösung anzeigen
TESTE DEIN WISSEN

find values for w0 and w1 minimizing the sum of the squared error

Lösung ausblenden
• 719334 Karteikarten
• 15031 Studierende
• 414 Lernmaterialien

Beispielhafte Karteikarten für deinen Data Science Kurs an der IU Internationale Hochschule - von Kommilitonen auf StudySmarter erstellt!

Q:

How are true positive and true negative rate calculated?

A:

TP rate = sensitivity = 1- FNR = 1 - (Type II errors/P) = TP/P

TN rate = specificity = 1 - FPR = 1 - (Type I errors/N) = TN/N

Type I error: false positive or FP

Type II error: false negative or FN

Q:

Which activites belong to the Data Flow dimension?

A:
• data collection
• data storage
• data accessing
Q:

Which activites belong to the Data Curation dimension?

A:
• data cleaning
• data presentation
• data evaluation
Q:

Which activites belong to the Data Analytics dimension?

A:
• statistical analysis
• modeling & simulations
• visual techniques
Q:
What are the two data types?
A:
quantitative: data are measurable values

qualitative: provide information about the quality of a good or a service

Q:

Describe the subtypes of quantitative data

A:
• categorical nominal: no inherent order
• categorical ordinal: has an inherent order
• categeorical binary: divided in 2 categories
• discrete: data attribute = any digit in the numbering system
• continuous: data attribute = value within range
Q:

What are the five approaches to transform a dataset?

A:
• logarithm tarnsformation
• power law transformation
• reciprocal transformation
• discrete fourier transform
Q:

How are the target variable and the output of the regression model related to each other?

A:

y' = y + epsilon

The predicted output slightly differs vom the target because the relationship between the independent variables and the target variable is not exactly linear. Therefore we need to add the error term epsilon.

Q:

Compare Feature Engineering and Deep Learning

A:

In FE we have to define our features, DL finds the features on its own.

Domain knowledge is needed to choose relevant features.

Q:

Compare batch and stream processing.

A:

Batch = transmits data as a block; for example, retailer store

Stream = provides data immediately; for example, sensor data in industry

Q:

For what is AR used? And for what ARIMA?

A:

AR = for data with an underlying linear relationship

ARIMA = for non-linear data; uses additional MA an integrated terms

Q:

What is the "least-squares method"?

A:

find values for w0 and w1 minimizing the sum of the squared error

Erstelle und finde Lernmaterialien auf StudySmarter.

Greife kostenlos auf tausende geteilte Karteikarten, Zusammenfassungen, Altklausuren und mehr zu.

Das sind die beliebtesten StudySmarter Kurse für deinen Studiengang Data Science an der IU Internationale Hochschule

Für deinen Studiengang Data Science an der IU Internationale Hochschule gibt es bereits viele Kurse, die von deinen Kommilitonen auf StudySmarter erstellt wurden. Karteikarten, Zusammenfassungen, Altklausuren, Übungsaufgaben und mehr warten auf dich!

Das sind die beliebtesten Data Science Kurse im gesamten StudySmarter Universum

Big Data & Data Science

FOM Hochschule für Oekonomie & Management

Big Data / Data Science

FOM Hochschule für Oekonomie & Management

Big Data & Big Data Science

FOM Hochschule für Oekonomie & Management