I am working on a deep learning model for "Age Invariant Face detection". I started with preprocessing the data which included face cropping via MTCNN, image contrast correction and sharpening. I then moved to build the model architecture, I designed a custom sequential CNN model. The training and validation losses were close to 1.9 but the model was taking approx 8 hrs for one epoch. I needed fast results so I moved to try on transfer learning. I read a paper which said Resnet-18 performed well on the task so I went with Resnet-18 first. Tried on tuning it by varying learning rates, but for no good. Accuracy wasn't increasing beyond 40%. I then went with Resnet-50, currently, I am working on it. I tried fine-tuning by changing the learning rate manually, tried with different optimizers Adam, SGD. Used learning rate decay. My model started to overfit so then I added dropout, image augmentation to training data and again added weight_decay as it was still overfitting. Right now there is no overfit but the losses aren't decreasing. I am trying on doing my experiments over it, reading many articles per day but feels like I'm stuck.
Related Questions in DEEP-LEARNING
- [Caffe]: Check failed: ShapeEquals(proto) shape mismatch (reshape not set)
- Caffe net.predict() outputs random results (GoogleNet)
- Implementation of convolutional sparse coding in deep networks frameworks
- Matlab example code for deep belief network for classification
- Two errors while running Caffe
- How to speed up caffe classifer in python
- Caffe Framework Runtest Core dumped error
- Scan function from Theano replicates non_sequences shared variables
- Why bad accuracy with neural network?
- Word2Vec Sentiment Classification with R and H2O
Related Questions in DATA-SCIENCE
- How access a downloaded library that is not showing up?
- Convert groupby.DataFrameGroupBy object to a dictionary in Python
- How can I detect keypresses using accelerometer/gyroscope data?
- Multiple Linear Regression handle NA
- Input/output error while copying from hadoop file system to local
- Removing duplicated values with missing values in a dataframe
- R editing dataframe based on column value
- PredictionIO Universal Recommender
- Pandas : TypeError: float() argument must be a string or a number
- Text classification algorithms which are not Naive?
Related Questions in CONV-NEURAL-NETWORK
- Using Convolution Neural Net with Lasagne in Python error
- How to prepare data for torch7 deep learning convolutional neural network example?
- additive Gaussian noise in Tensorflow
- Same output in neural network for each input after training
- ConvNet : Validation Loss not strongly decreasing but accuracy is improving
- Tensor flow affecting multiprocessing/threading
- Inceptionv3 Transfer Learning on Torch
- Transfer weights from caffe to tensorflew
- Lasagne NN strange behavior with accuracy and weight convergence
- Multiple outputs in Keras gives value error
Related Questions in RESNET
- how to pretrain my image using resnet50 in mask-rcnn
- Keras model.predict always 0
- Defining model in Keras
- Error with to_proto(caffe-master/python/caffe/net_spec.py)-- 'LayerParameter' object has no attribute 'num_output'
- Tensorflow 2 Object Detection with OpenCV C++
- Fine tuning a model in transfer learning
- Detecting the location (latitude and longitude) of utility poles in an area using Resnet
- Accuracy and Recall is the same for resnet implementation pytorch
- What should I do when I'm getting an mAP of 0.000 using keras-retinanet / resnet50?
- Layer.get_weights() returns wrong output with shape(1, 1, 1, 2080, 1536)
Related Questions in TRANSFER-LEARNING
- Implementing Transfer Learning using Pegasus for Text Summarization generating junk characters
- How to add new classes to tensorflow.js model in transfer learning?
- Prediction for single image file using tensor flow transfer learning model
- Can this be considered overfitting?
- why am I getting error in transfer learning?
- Fine tuning a model in transfer learning
- How should I improve my accuracy in Transfer learning?
- Do you have any ideas to proceed custom size image using MobileNet?
- WARNING:tensorflow:Can save best model only with val_accuracy available, skipping
- ValueError: The input must have 3 channels; got `input_shape=(200, 200, 1)`
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Popular Tags
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Apparently, this is one of the very good cases where feature engineering comes to the light. Try as much as you can to make your images use less computational power by apply some classic feature engineering methods. Using MTCNN is a really good approach and I would deffo look into Feature Engineering, also here is a link that could provide you some AHAA moments performace wise. All the best