View syllabus
MULTIMEDIA INFORMATION RETRIEVAL AND COMPUTER VISION
GIUSEPPE AMATO
Academic year2020/21
CourseARTIFICIAL INTELLIGENCE AND DATA ENGINEERING
Code886II
Credits9
PeriodSemester 1
LanguageEnglish

ModulesAreaTypeHoursTeacher(s)
MULTIMEDIA INFORMATION RETRIEVAL AND COMPUTER VISIONING-INF/05LEZIONI90
GIUSEPPE AMATO unimap
FABRIZIO FALCHI unimap
CLAUDIO GENNARO unimap
Obiettivi di apprendimento
Learning outcomes
Conoscenze

Fornire agli studenti un solido background su Multimedia Information Retrieval, Multimedia Content Based Retrieval, Automated Multimedia Content Understanding, Multimedia Data Mining, Computer Vision.

Comprendere le sfide e le problematiche legate all'efficacia, all'efficienza e alla scalabilità quando si ha a che fare con set di dati multimediali molto grandi (su scala web).

Knowledge

Providing students with solid background on Multimedia Information Retrieval, Multimedia Content Based Retrieval, Automated Multimedia Content Understanding, Multimedia Data Mining, Computer Vision.

Understanding the challenges and issues related to effectiveness, efficiency, and scalability when dealing with very large (web scale) multimedia data sets. 

Modalità di verifica delle conoscenze

La valutazione delle conoscenze sarà effettuata attraverso il progetto finale (opzionale) e la prova orale.

Assessment criteria of knowledge

Assessment of knowledge will be through final project (optional) and oral test.

Capacità

Gli studenti impareranno a ragionare intorno ai problemi e le soluzioni per Text-Based Retrieval, Multimedia Content Analysis, Multimedia Content Description, Computer Vision, Multimedia, Content Based Indexing, Multimedia Content Based Retrieval, Multimedia Data Mining and Classification, Similarity Searching, Scalable Access Methods for Similarity Searching, Deep Learning technqiues applied to multimedia

Skills

Students will learn issues and solutions to Text-Based Retrieval, Multimedia Content Analysis, Multimedia Content Description, Computer Vision, Multimedia, Content Based Indexing, Multimedia Content Based Retrieval, Multimedia Data Mining and Classification, Similarity Searching, Scalable Access Methods for Similarity Searching, Deep Learning technqiues applied to multimedia

Modalità di verifica delle capacità

Durante le sessioni di laboratorio, agli studenti verranno proposti compiti mirati da risolvere utilizzando strumenti appresi nelle sessioni di teoria. 

Assessment criteria of skills

During the computer lab sessions, students will be proposed focussed tasks to solve using tools and writing code related to the theory sessions. 

Comportamenti

Gli studenti impareranno metodologie di valutazione rigorose per valutare le prestazioni delle soluzioni per Multimedia Information Retrieval

Behaviors

Students will learn rigorous evaluation methodologies to asses performance of multimedia information retrieval solutions

Modalità di verifica dei comportamenti

I progetti finali richiederanno la valutazione delle prestazioni delle soluzioni sviluppate utilizzando  misure rigorose di Multimedia Information Retrieval

Assessment criteria of behaviors

Final projects will require evaluatinng the performance of the developed solutions using rigorous multimedia information retrieval measures.

Prerequisiti (conoscenze iniziali)

Nozioni di base sui sistemi di gestione dei database, programmazione Python

Prerequisites

Basics of Database management systems, Python programming

Teaching methods

Lectures, with visual aids such as powerpoints/videos and computer lab sessions

Programma (contenuti dell'insegnamento)

Text Indexing and Retrieval:

  • Introduction to Information Retrieval. Information Retrieval Systems vs Database Systems. Boolean retrieval. Term-document incidence matrices. Inverted indexes. Indexing Inverted indexes. Merge Algorithms: AND, and, BUT. Issues about the boolean retrieval and introduction to the Ranked Retrieval. Scoring documents. Term frequency. Collection statistics Weighting schemes. Vector space scoring.
  • Term Frequency and Inverse Document Frequency. Vector space representation. Cosine similarity, Computing cosine scores, Normalizations, Maximum tf normalization. Evaluation in information retrieval,  Evaluation in information retrieval: Precision, Recall, and Accuracy. F1 metric. 
  • Mean Average Precision. Text Classification. Evaluation of Text Classifiers. Confusion matrix. Binary Classification. Multiclass Classification. f-score. Macro- vs Micro- Averaging. Supervised Learning. k nearest neighbor classification. Query processing and phrase queries. 
  • Content-based and Text-based image retrieval. Global and Local features for Content-based image retrieval (CBIR). Metric Spaces. Triangle inequality and indexing. CBIR through permutation-based approach. Complemented permutations. Content-based image retrieval through standard text retrieval techniques.

Audio Indexing and Retrieval

  • Audio retrieval and Fingerprinting. Notes on Fourier Transform, Discrete Fourier Transform, and Short Time Fourier Transform. Spectral Peak as audio fingerprint. Indexing Fingerprint with inverted lists.

Multimedia Content Indexing and Representation

  • Multimedia Content Representation: color spaces, still images, video
  • Global Features, Local Invariant Feature Detectors, Local Features
  • Local Features Matching, Local Features Aggregation
  • Deep Learning in Computer Vision, Convolutional Neural Networks
  • Deep Learning Advanced Topics, Transfer Learning
  • Detection and Segmentation: fully convolutional neural network, R-CNN, Fast R-CNN, Faster R-CNN, Mask R-CNN
  • Generative Models: PixelRNN, PixelCNN, Variational Autoencoder, Generative Adversarial Network

Multimedia Searching on a Large Scale:

  • Foundation of Metric Space Searching, Distance Searching Problem, Metric Distance Measures, Similarity Queries, Basic Partitioning Principles, Principles of Similarity Query Execution, Policies for avoiding Distance Computations, Priciples of Approximate Similarity Search
  • Advanced issues: Statistics, Proximity, Performance Prediction, Tree quality Measures, Choosing reference points
  • Exact Similarity Search: Vantage Point Trees, AESA/LAESA, The M-Tree Family, The M-Tree, Bulk-Loading Algorithm, Multy-Way Insertion Algorithms, The Slim Tree, Slim-Down Algorithm, Pivoting M-Tree
  • Hash Based Methods: D-Index. Approximate Similarity Search with M-Tree,
  • Approximate Similarity Search: Relative Error Approximation, Good Fraction Approximation, Small Chance Improvement Approximation, Proximity-Based Approximation, PAC Nearest Neighbor Searching
  • Performance tests
  • Recent Similarity Search Techniques: Permutation Based Methods, Permutation Spearman Rho, PP-Index, MI-File, Locality Sensitive Hashing (LSH), LSH based on p-stable distributions, LSH with Hamming Distance 
Syllabus

Text Indexing and Retrieval:

  • Introduction to Information Retrieval. Information Retrieval Systems vs Database Systems. Boolean retrieval. Term-document incidence matrices. Inverted indexes. Indexing Inverted indexes. Merge Algorithms: AND, and, BUT. Issues about the boolean retrieval and introduction to the Ranked Retrieval. Scoring documents. Term frequency. Collection statistics Weighting schemes. Vector space scoring.
  • Term Frequency and Inverse Document Frequency. Vector space representation. Cosine similarity, Computing cosine scores, Normalizations, Maximum tf normalization. Evaluation in information retrieval,  Evaluation in information retrieval: Precision, Recall, and Accuracy. F1 metric. 
  • Mean Average Precision. Text Classification. Evaluation of Text Classifiers. Confusion matrix. Binary Classification. Multiclass Classification. f-score. Macro- vs Micro- Averaging. Supervised Learning. k nearest neighbor classification.Query processing and phrase queries. 
  • Content-based and Text-based image retrieval. Global and Local features for Content-based image retrieval (CBIR). Metric Spaces. Triangle inequality and indexing. CBIR through permutation-based approach. Complemented permutations. Content-based image retrieval through standard text retrieval techniques.

Audio Indexing and Retrieval

  • Audio retrieval and Fingerprinting. Notes on Fourier Transform, Discrete Fourier Transform, and Short Time Fourier Transform. Spectral Peak as audio fingerprint. Indexing Fingerprint with inverted lists.

Multimedia Content Indexing and Representation

  • Multimedia Content Representation: color spaces, still images, video
  • Global Features, Local Invariant Feature Detectors, Local Features
  • Local Features Matching, Local Features Aggregation
  • Deep Learning in Computer Vision, Convolutional Neural Networks
  • Deep Learning Advanced Topics, Transfer Learning
  • Detection and Segmentation: fully convolutional neural network, R-CNN, Fast R-CNN, Faster R-CNN, Mask R-CNN
  • Generative Models: PixelRNN, PixelCNN, Variational Autoencoder, Generative Adversarial Network

Multimedia Searching on a Large Scale:

  • Foundation of Metric Space Searching, Distance Searching Problem, Metric Distance Measures, Similarity Queries, Basic Partitioning Principles, Principles of Similarity Query Execution, Policies for avoiding Distance Computations, Priciples of Approximate Similarity Search
  • Advanced issues: Statistics, Proximity, Performance Prediction, Tree quality Measures, Choosing reference points
  • Exact Similarity Search: Vantage Point Trees, AESA/LAESA, The M-Tree Family, The M-Tree, Bulk-Loading Algorithm, Multy-Way Insertion Algorithms, The Slim Tree, Slim-Down Algorithm, Pivoting M-Tree
  • Hash Based Methods: D-Index. Approximate Similarity Search with M-Tree,
  • Approximate Similarity Search: Relative Error Approximation, Good Fraction Approximation, Small Chance Improvement Approximation, Proximity-Based Approximation, PAC Nearest Neighbor Searching
  • Performance tests
  • Recent Similarity Search Techniques: Permutation Based Methods, Permutation Spearman Rho, PP-Index, MI-File, Locality Sensitive Hashing (LSH), LSH based on p-stable distributions, LSH with Hamming Distance 
Bibliografia e materiale didattico

Si consulti il sito del corso: https://sites.google.com/site/unipimircv/

Bibliography

See Course Web Site: https://sites.google.com/site/unipimircv/

Non-attending students info

-

Modalità d'esame

L'esame finale è costituito da un progetto pratico opzionale e da una prova orale.

Il progetto utilizzerà le nozioni apprese durante le sessioni teoriche e la sessione di laboratorio per sviluppare uno strumento software completamente funzionale per gestire uno scenario specifico relativo alla gestione e al recupero delle informazioni multimediali. Il progetto sarà assegnato a gruppi di studenti (tipicamente da 1 a 5).

Il test orale ha lo scopo di verificare che gli studenti abbiano acquisito le conoscenze e le competenze richieste.

Assessment methods

Final exam is made of an optional practical project and an oral test.

The project will use the notions learnt during the theory sessions and the computer lab session to develop a fully functional software tool to handle a specific scenario related to Multimedia Information Management and Retrieval. The project will be assigned to groups of students (typically from 1 to 5).

The oral test aims at checking the students acquired the required knowledge and skills 

Updated: 24/09/2020 14:42