Explore our extensive collection of questions and answers to enhance your learning experience and prepare for exams effectively
The learning rate determines the size of steps taken during gradient descent optimization.
Computer vision aims to allow machines to extract meaningful data from images or videos mimicking human vision.
Clustering groups similar words or phrases based on semantic or syntactic similarity.
The attention mechanism allows models to focus on important parts of the input sequence, improving performance.
The Activation Layer, using functions like ReLU, introduces non-linearity to the model.
Dropout randomly deactivates a subset of neurons during training to prevent overfitting.
Word Embedding converts words into numerical vectors that capture semantic meaning.
TF-IDF (Term Frequency-Inverse Document Frequency) measures a word’s importance based on its frequency and rarity across documents.
Stemming reduces words to their root or base form by removing suffixes.
Bag of Words represents text as a vector of word frequencies, ignoring grammar and order.