Saif Zaidan

My Name is Saif Zaidan

Machine Learning Engineer.

Enthusiastic about AI, programming, and constructing intelligent systems aimed at enhancing human experiences.

Resume

About Me


I am a graduate in Data Science and Artificial Intelligence from Princess Sumaya University for Technology.
My passion for comprehending intelligence and its real-world applications in AI is boundless.
I hold a profound interest in the field of Reinforcement Learning
and LLMS. Presently, I am employed at Samsung Research & Development in Jordan.
                
• Recent Skills and Technologies:

ML Research.
Problem Solving.
Transformers.
Pytorch.
C++/Python/Java.
Tensorflow.
Django.
HuggingFace/Bits&bytes.

Projects

Pothole Detection & Severity estimitation.

My solution for the SDIAI "The Smart Cities Challenge", realtime pothole detection with severity estimation based on depth and location.

Notebook Demo

Smart Autonomous grasping Robot (Graduation Project).

Smart Autonomous Grasping Robot using Pepper Robot A robotic system designed for grasping various types of litter, including soda cans, paper cups, and water bottles. It operates within a simulated environment using PyBullet, leveraging reinforcement learning for motion control and computer vision for litter classification.

Transformer Implementation From Scratch.

Implemenation of Attention is all you need PAPER Using Pytorch.

Sudoku Game With puzzle Generation.

Basic Sudoku Game implemented in python using pygame which generates puzzle using backtracking.

Game Engine built from scratch using C++ & Open GL

C++ Graphics Renderer from scratch including camera, lights, objects and shaders.

finger control system using computer vision.

Enhancing finger control through hand pose estimation and hand landmark detection enables real-time interaction with the screen using only hand gestures. This technology allows for intuitive and precise manipulation of digital interfaces, offering a seamless and immersive user experience.

Gym Database Managment System.

a Gym DataBase managent system implemented in python using sqlite.

Experience

Samsung Research
Sitech
AL-Ghanem Group

Machine Learning Engineer. - Samsung Research

Aug 2023 - Present

Samsung Research & Development

Researched the space of arabic nlp, llms and transformers and their on device applications and challenges regarding memeory and latency.
Quantization (bits&bytes), hugging face, vector DB’s and Rags.
contributed and developed ASR, TTS and ITN systems for arabic language.
delvoloped web Application for Data Managment and tracking using Django.
Rule Based ARABIC Inverse Text Information using java.
ARABIC Text-to-Speech system using C++.
ARABIC Automatic Speech Recognition models using tensorflow/pytorch (Cascaded Conformer).

Data Science intern - Sitech

July 2021 - Oct 2021

• Worked on multiple projects during this internship:

1) Topic Modeling:
This project aims to find the set of topics that best describe a document.

• Data: used the ABC NEWS dataset, it contains data of news headlines
published over a period of 14 years.
• Algorithms:
• Stemming and Lemmatization as a text preprocessing step.
• TF-IDF.
• LDA for classifying Documents.

2) Movie Rating Prediction:
This project aims to predict the Rating of a movie based on a set of attributes.

• Data : used the Netflix Movies and TV Shows This dataset consists of listings of all
the movies and tv shows available on Netflix, long with details such as - cast, directors, ratings, release year, duration, etc.
• Algorithms :
• Data Cleaning, Feature engineering and Preprocessing.
• Logistic Regression & Linear Regression.

3) House Price Prediction competition:
This project aims to predict Price of a house based on a set of attributes.

• Data : used the Housing Prices with 79 explanatory variables describing (almost) every aspect of
residential homes in Ames, Iowa, this competition challenges you to predict the final price of each home.
• Algorithms & Code :
• Data Cleaning, Feature engineering and Preprocessing.
• XGBoost.

Me :)

Data Engineering intern - AL-Ghanem Group

May 2022 - Dec 2022

This internship was focused to help solve a large data problem that faced the company,
Problem Description:
the company had more than 100k records of companies that had a lot of attributes for each company
including name,domain,contacts, etc.
but those records are duplicated in a fuzzy way, in which there are multiple names for the same company
because of data entry and old company policy issues.
Hence, the task was to reduce the number of duplicated records based on name similarity and data analysis process.

Solution:
• Technologies:
⚬ Python
⚬ Sql

• Approach:
⚬ first i used fuzzy matching algorithms within the fuzzy-wuzzy library to find companies based
on their name similarity using a python script.

⚬ then i went deeper into data analysis using Pandas library to find another attributes which can contribute to finding
duplicate companies, like: domain name, region, email correspondence, etc.

⚬ after that i classified companies into:
▬ Deleted : company records that are not necessary.
▬ Merged : company records that need to be merged with their fuzzy duplicate.
then we made stored procedures in sql to handle these companies.

⚬ By using pyodbc library i was able to connect to the company's live sql server, and with a python
script i was able to solve the company's data problems and retain a better and more valuable database for the comapany.

Achievements

Education & certificates

2019 - present

BSc of Data Science & AI

Princess Sumaya university for technology, Amman

  
    • Extracurricular Activities:

      ⚬ Former member of PSUT's Data Science club:
         Helped in organizing and Giving Workshops regarding various topics: 
                python, Kaggle competitions and jupyter-notebook analysis.

      ⚬ organizer for the AI–Ability initiative by PSUT which taught 
          foundational ML to school students in Jordan.
      
      ⚬ Participation in IEEEXtreme 16.0. Certificate

      ⚬ Calculus one instructor. 

      ⚬ Ana-Ushark initiative volunteer.

April-2022

TensorFlow Developer Professional Certificate

Tensorflow

▬  Certificate

March-2022

Deep Learning Specialization

DeepLearning.AI

  
⚬ This course is taught by Andew-ng, an adjunct professor at Stanford University; founder & CEO of
    DeepLearning.AI.

⚬ It consists of the following 5 seperate courses (which, combined, take about 180 hours to complete):
        
        1) Neural Networks and Deep Learning
        2) Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization
        3) Structuring Machine Learning Projects
        4) Convolutional Neural Networks
        5) Sequence Models
  
⚬   Certificate

Get in Touch

My inbox is always open. Whether you have a question or just want to say hi, I’ll try my best to get back to you!

Mail me