CV

Continuous Learning - 'We cannot become what we want by remaining what we are', Max DePee.

Basics

Name Valentinos Pariza
Label PhD Student @ UTN
Email valentinos.pariza@utn.de
Url https://vpariza.github.io
Summary AI enthusiast and PhD researcher specializing in Self-Supervised Learning for vision, driven by a passion to uncover the strengths and limitations of Deep Learning models and technologies. Guided by curiosity and fueled by innovation, I strive to explore, learn, and contribute to shaping the future of AI with every discovery.

Work

  • 2024.02 - 2024.07
    Research Intern
    TNO - Netherlands Organisation for Applied Scientific Research
    Worked on improving In-Context Learning in Computer Vision with Patch Nearest Neighbor Consistency as part of my Master's Thesis.
    • Collaborated with Dr. Yuki M. Asano and Dr. Gertjan Burghouts.
    • Advanced understanding of in-context learning for vision tasks.
  • 2023.07 - 2023.08
    Data Science Intern
    South Pole
    Enhanced product-to-category matching algorithms using machine learning techniques.
    • Diagnosed issues in the matching of products to CO2 footprint algorithms through log analysis.
    • Created a ground truth dataset for evaluating and training a better matching of Products to Categories via web scraping.
    • Improved top-1 system's Product to Category matching accuracy by 25.2% using a Text Transformer Encoder.
    • Deployed models to production with Kubernetes.
  • 2022.06 - 2022.07
    Research Intern
    CYENS Centre of Excellence
    Developed a Python library to accelerate Quality Diversity Optimization algorithms.
    • Enhanced efficiency of state-of-the-art QD algorithms by over 2x using JAX.
  • 2020.08 - 2021.07
    Software Development Engineer Intern
    Amazon Data Services Ireland
    Developed tools to optimize internal data processes and improve code logging practices.
    • Automated report generation, saving hours of manual effort.
    • Improved a Java library for accessing internal AWS data.
    • Promoted good code logging practices, saving thousands of dollars annually.
    • Optimized a large-scale real-time log analysis service.
  • 2020.06 - 2020.08
    Research Intern
    CYENS Centre of Excellence
    Built a Speech Emotion Recognition system for dyadic conversations.
    • Engineered an end-to-end system using state-of-the-art methodologies.
  • 2019.10 - 2022.03
    Co-Founder & Backend Developer
    Fooderloo
    Co-founded a business addressing food waste in Cyprus, developing its backend systems.
    • Designed and implemented backend, database, and server-side systems.
  • 2019.06 - 2019.07
    Research Intern
    Department of Computer Science, University of Cyprus
    Developed algorithms to improve micro-network performance and fault tolerance.
    • Designed fault-tolerant message routing and faulty node detection algorithms.
    • Developed a Java-based simulation program for testing routing algorithms.
  • 2016.07 - 2017.09
    Signal Soldier
    National Guard of Cyprus
    Served as a Network and Computer Technician, operating intra-office communications.

Education

  • 2024.12 - Present

    Nuremberg, Germany

    PhD
    University of Technology Nuremberg (UTN)
    Deep Learning for Vision
  • 2022.09 - 2024.08

    Amsterdam, Netherlands

    MSc
    University of Amsterdam (UvA)
    Artificial Intelligence (AI)
    • Deep Learning 1 & 2
    • Computer Vision 1 & 2
    • Natural Language Processing
    • Machine Learning
    • Game Theory
    • Information Retrieval
    • Fairness, Accountability, Confidentiality and Transparency in AI
    • Knowledge Representation and Reasoning
  • 2017.09 - 2022.06

    Nicosia, Cyprus

    BSc
    University of Cyprus (UCY)
    Computer Science (CS), with a focus in AI
    • Data Structures & Algorithms
    • Object Oriented Programming (Java)
    • Algorithms
    • Computer Networks
    • Operating Systems
    • Databases
    • Software Engineering
    • Artificial Intelligence
    • Machine Learning
    • Computer Vision
    • Data Mining on the Web
    • Calculus I & II
    • Linear Algebra
    • Probabilities & Statistics
    • Theory of Computation
    • Introduction to Economics
    • Logic Programming & Artificial Intelligence
    • Parallel Processing
    • System Security
    • Synthesis of Parallel Algorithms
    • Programming & Tools
    • Logic in Computer Science

Awards

Certificates

Using Python to Access Web Data by University of Michigan
Coursera & Michigan University 2020-08-01
Python Data Structures by University of Michigan
Coursera & Michigan University 2020-06-01
Machine Learning by Stanford University
Coursera & Stanford University 2019-08-01
Mathematics for Machine Learning: Linear Algebra by Imperial College London
Coursera & Imperial College of London 2019-07-01

Publications

  • 2025.01.22
    NeCo: Improving DINOv2's spatial representations in 19 GPU hours with Patch Neighbor Consistency
    ICLR 2025
    The paper proposes a new self-supervised learning method, NeCo (Patch Neighbor Consistency), which enhances pretrained representations by enforcing patch-level nearest neighbor consistency between a student and teacher model. This is done using a differentiable sorting method on top of pretrained models like DINOv2. The method achieves significant performance improvements with minimal computational cost (19 hours on a single GPU). It outperforms previous methods, setting new state-of-the-art results for semantic segmentation on several datasets, including ADE20k, Pascal VOC, and COCO-Things and COCO-Stuff.
  • 2023.07.20
    [Re] Reproducibility Study of 'Label-Free Explainability for Unsupervised Models'
    ReScience C
    This work evaluates the reproducibility of the paper 'Label-Free Explainability for Unsupervised Models' by Crabbe and van der Schaar. The goal is to reproduce the paper's four main claims in a label-free setting: (1) feature importance scores reveal key features of a model's input, (2) example importance scores identify key training examples to explain a test example, (3) saliency map interpretability is difficult for disentangled VAEs, and (4) different pretext tasks have non-interchangeable representations.

Skills

Python
Object Oriented Programming
Data Structures
Algorithms
Web Development
Machine Learning
Deep Learning
Computer Vision
Natural Language Processing
Data Mining
Data Analysis
Data Visualization
Web Scraping
APIs
Databases
TensorFlow
Keras
TensorBoard
TensorFlow Lite
TensorFlow.js
TensorFlow Serving
TensorFlow Hub
TensorFlow Extended
TensorFlow Quantum
TensorFlow Graphics
TensorFlow Probability
TensorFlow Agents
TensorFlow Addons
TensorFlow Recommenders
TensorFlow I/O
TensorFlow Privacy
TensorFlow Federated
TensorFlow Model Optimization
PyTorch
Pytorch Lightning
Pytorch TOrchvision
Pytorch Datasets
Pytorch Audio
Pytorch Text
Pytorch Geometric
Pytorch NN Module
Databases
SQL
NoSQL
MongoDB
PostgreSQL
SQLite
DynamoDB
Data Structures & Algorithms
Arrays
Linked Lists
Stacks
Queues
Trees
Graphs
Heaps
Hash Tables
Sorting
Searching
Dynamic Programming
Greedy Algorithms
Backtracking
Divide and Conquer
Bit Manipulation
Recursion
Web Development
HTML
CSS
JavaScript
TypeScript
Bootstrap
jQuery
React
Angular
Node.js
Django
Flask
FastAPI
Ruby on Rails
ASP.NET
JSP
PHP
Java
Object Oriented Programming
Data Structures
Algorithms
Web Development
Machine Learning
Natural Language Processing
Data Mining
Data Analysis
Data Visualization
Web Scraping
APIs
Databases
C/C++
Object Oriented Programming
Data Structures
Algorithms
Machine Learning
Natural Language Processing
Deep Learning
Databases

Languages

Greek
Native speaker
English
Fluent
Russian
Intermediate

Interests

Artificial Intelligence, Machine & Deep Learning
Deep Learning
Vision Models
Machine Learning Algorithms
Neural Networks
Model Optimization
Transfer Learning
Representation Learning
Self-Supervised Learning (SSL)
Synthetic Data
Data Augmentation
Generative Models
GANs (Generative Adversarial Networks)
Diffusion Models
Synthetic Dataset Creation
Domain Randomization
Data Efficiency
AI Training Scalability
Curiosity-Driven Exploration
Research and Development (R&D)
Novel Discoveries
Exploratory Data Analysis (EDA)
Experimental Design
Intellectual Curiosity
Open-Ended Problem Solving
Teaching and Mentoring
Knowledge Sharing
Technical Training
Curriculum Development
Educational Workshops
Mentorship Programs
Peer Collaboration
Innovation and Problem Solving
Cutting-Edge Technology
Creative Solutions
Applied Research
Product Development
Breakthrough Thinking
Innovation-Driven Development
Continuous Growth
Lifelong Learning
Professional Development
Skill Acquisition
Emerging Technologies
Adaptability
Personal Growth
Multidisciplinary Collaboration
Interdisciplinary Research
Neuroscience
Computational Modeling
Cross-Domain Synergy
Systems Thinking
Collaborative Frameworks
Languages and Communication
Effective Storytelling
Technical Writing
Academic Presentations

References

Professor Yuki M. Asano
Professor at the University of Technology Nuremberg (UTN) [link]
Dr. Gertjan Burghouts
Researcher in deep learning and computer vision at the Netherlands Organisation for Applied Scientific Research (TNO) [link]

Projects

  • 2023.09 - 2024.08
    Open Hummingbird Evaluation
    Developed and published the Dense Nearest Neighbor Retrieval Evaluation (Balaˇzevi'c et al. "Towards In-contextScene Understanding") for testing the In-Context Learning Capabilities of vision encoders.
    • Deep Learning
    • Vision Encoders
    • Self-Supervised Learning
    • Nearest Neighbor Retrieval
    • In-Context Learning
    • Evaluation Metrics
  • 2023.04 - 2024.06
    Evaluating the Robustness of 3D Occupancy Prediction Models Under Noisy Data Conditions
    This project focused on evaluating the robustness and reliability of State-of-the-Art 3D Occupancy Prediction Models under noisy or corrupted input data, simulated using synthetic data. The study highlighted the inadequacy of training models with data that does not reflect real-world scenarios. It demonstrated significant improvements in performance when applying appropriate data augmentations, achieving a 1-3% increase on noisy datasets and a 0.5-1% increase on clean datasets, further advancing state-of-the-art results.
    • Computer Vision
    • 3D Occupancy Prediction
    • 3D Object Detection
    • Synthetic Data
    • Data Augmentation
    • Robustness Evaluation
    • Deep Learning
  • 2023.04 - 2024.06
    Iterative Image Refinement Using Socratic Models: Exploring Bias and Hallucination Effects
    This project involved designing and implementing an iterative approach inspired by Socratic Models, which uses a pipeline of models to iteratively refine a prompt and generate improved images. The process involves leveraging a history of generated images and their refined captions (via a language model). The study revealed the bias and hallucination effects that emerge when connecting multiple models, highlighting challenges in model interactions and data generation.
    • Deep Learning
    • Image Generation
    • Bias Detection
    • Hallucination Detection
    • Socratic Models
    • Iterative Refinement
    • Language Models