AI EXPRESS
  • AI
    In tech, reliability comes before innovation

    In tech, reliability comes before innovation

    Google Cloud expands contact center automation offerings with third-party integrations

    New threat intelligence solution launched by Google Cloud  

    Intel VP talks AI strategy as company takes on Nvidia

    Intel VP talks AI strategy as company takes on Nvidia

    Cloudera

    Cloudera claims its new ‘all-in-one’ data lakehouse cuts ownership costs by up to 35%

    DARPA seeks AI solutions for sourcing critical minerals

    DARPA seeks AI solutions for sourcing critical minerals

    Who owns DALL-E images? Legal AI experts weigh in

    Who owns DALL-E images? Legal AI experts weigh in

  • ML
    AWS Localization uses Amazon Translate to scale localization

    AWS Localization uses Amazon Translate to scale localization

    Announcing the launch of the model copy feature for Amazon Rekognition Custom Labels

    Announcing the launch of the model copy feature for Amazon Rekognition Custom Labels

    Use deep learning frameworks natively in Amazon SageMaker Processing

    Intelligent document processing with AWS AI services: Part 2

    Customize your recommendations by promoting specific items using business rules with Amazon Personalize

    Customize your recommendations by promoting specific items using business rules with Amazon Personalize

    Amazon SageMaker JumpStart solutions now support custom IAM role settings

    Amazon SageMaker JumpStart solutions now support custom IAM role settings

    Amazon SageMaker Automatic Model Tuning now supports SageMaker Training Instance Fallbacks

    Amazon SageMaker Automatic Model Tuning now supports SageMaker Training Instance Fallbacks

    Build a GNN-based real-time fraud detection solution using Amazon SageMaker, Amazon Neptune, and the Deep Graph Library

    Build a GNN-based real-time fraud detection solution using Amazon SageMaker, Amazon Neptune, and the Deep Graph Library

    Build an air quality anomaly detector using Amazon Lookout for Metrics

    Build an air quality anomaly detector using Amazon Lookout for Metrics

    Use computer vision to measure agriculture yield with Amazon Rekognition Custom Labels

    Use computer vision to measure agriculture yield with Amazon Rekognition Custom Labels

  • NLP
    Central Park in New York City

    Researchers Track Twitter to Learn What People Value in New York City Parks

    A Little More Conversation: How Programmatic Is Driving Podcast Ad Innovation

    A Little More Conversation: How Programmatic Is Driving Podcast Ad Innovation

    Historical costume descriptors bridge gap between past and present | VTx

    Historical costume descriptors bridge gap between past and present | VTx

    Maritime industry to spend $931 mln on AI solutions in 2022

    Maritime industry to spend $931 mln on AI solutions in 2022

    IonQ Announces Second Quarter 2022 Financial Results

    PatientMetRx Patient Opinion Map

    Through a glass, (more) clearly – PharmaLive

    Researchers Develop DL-GuesS: A Deep Learning and Sentiment Analysis-Based Framework For Cryptocurrency Price Prediction

    Researchers Develop DL-GuesS: A Deep Learning and Sentiment Analysis-Based Framework For Cryptocurrency Price Prediction

    Busting homophobic, anti-queer bias in AI language models

    Busting homophobic, anti-queer bias in AI language models

    IATA Launches Online Platform to Help Identify Security Risks

    IATA Launches Online Platform to Help Identify Security Risks

  • Vision
    Seoul Robotics Helps Cars Move, Park on Their Own

    Seoul Robotics Helps Cars Move, Park on Their Own

    Pattern Recognition With Geometric Model Finder

    Pattern Recognition With Geometric Model Finder

    The Top 10 Applications of Computer Vision in Aviation

    The Top 10 Applications of Computer Vision in Aviation

    YOLOv7: The Fastest Object Detection Algorithm (2022)

    YOLOv7: The Fastest Object Detection Algorithm (2022)

    Progressive Growing Generative Adversarial Networks

    Progressive Growing Generative Adversarial Networks

    Deep Learning for Image Dehazing- The What, Why, and How

    Deep Learning for Image Dehazing- The What, Why, and How

    How to train and use a custom YOLOv7 model

    How to train and use a custom YOLOv7 model

    viso.ai Logo

    Deep Learning for Person Re-Identification (2022)

    NVIDIA Jetson AGX Orin 32GB Production Modules Now Available; Partner Ecosystem Appliances and Servers Arrive

    NVIDIA Jetson AGX Orin 32GB Production Modules Now Available; Partner Ecosystem Appliances and Servers Arrive

  • Robotics
    zoox taxi

    How Zoox robotaxis make predictions while on the road

    seoul robotics

    Seoul Robotics makes regular cars autonomous with LV5 CTRL TWR

    scan&sand

    GrayMatter Robotics’ sanding solution brings in $20M

    pitchfire

    Pitchfire startup competition submissions open

    sprout

    Muddy Machines brings in $1.8M for asparagus harvesting robot Sprout

    Levita Magnetics raises $26M for Magnetic-Assisted Robotic Surgery platform

    Levita Magnetics raises $26M for Magnetic-Assisted Robotic Surgery platform

    Marc Raibert Atlas dancing

    Hyundai launches Boston Dynamics AI Institute

    programmable material

    MIT CSAIL creates materials that can sense the way they move

    robotics investments and business opportunities

    The state of robotics investment

  • RPA
    How to Create a Rock Solid Technology Portfolio with Hyperautomation?| AutomationEdge

    How to Create a Rock Solid Technology Portfolio with Hyperautomation?| AutomationEdge

    Unlocking the Top Healthcare Automation Trends with Use Cases that Rule the World| AutomationEdge

    Unlocking the Top Healthcare Automation Trends with Use Cases that Rule the World| AutomationEdge

    Staying Ahead of the Time with AI-Powered Customer Experience

    Staying Ahead of the Time with AI-Powered Customer Experience| AutomationEdge

    Why is Developing Decision Intelligence with AI Support Crucial in Healthcare?

    Why is Developing Decision Intelligence with AI Support Crucial in Healthcare?

    Robotic Process Automation using Blue Prism

    Robotic Process Automation using Blue Prism

    AI- The Tech Medicine Ameliorating the Healthcare Industry?

    AI- The Tech Medicine Ameliorating the Healthcare Industry?| AutomationEdge

    Take employee experience into hyperdrive with Hyperautomation

    Hyperautomation- Your Answer to Enhance Employee Experience| AutomationEdge

    Know Why Automation Now Resides in the Heart of Customer Contact Centers| AutomationEdge

    Know Why Automation Now Resides in the Heart of Customer Contact Centers| AutomationEdge

    Conversational AI, Healing the Healthcare Industry| AutomationEdge

    Conversational AI, Healing the Healthcare Industry| AutomationEdge

  • Gaming
    Images from the set of Amazon's Fallout series have leaked

    Images from the set of Amazon’s Fallout series have leaked

    Random: Kirby Café Rustles Up A New Dessert To Celebrate Kirby's Dream Buffet

    Random: Kirby Café Rustles Up A New Dessert To Celebrate Kirby’s Dream Buffet

    Dying Light 2's first story DLC, Bloody Ties, to be revealed at gamescom Opening Night Live

    Dying Light 2’s first story DLC, Bloody Ties, to be revealed at gamescom Opening Night Live

    Why 4 million people – and counting – are flocking to Korea’s hottest battle royale

    Why 4 million people – and counting – are flocking to Korea’s hottest battle royale

    Saints Row trailer gives you a taste of the story

    Saints Row trailer gives you a taste of the story

    Advance Wars 1+2: Re-Boot Camp Scheduled Maintenance Spotted

    Advance Wars 1+2: Re-Boot Camp Scheduled Maintenance Spotted

    MultiVersus datamine suggests Beetlejuice and Oz’s Wicked Witch of the West are coming to the game

    MultiVersus datamine suggests Beetlejuice and Oz’s Wicked Witch of the West are coming to the game

    Call of Duty: Modern Warfare 2 open beta and multiplayer reveal dated

    Call of Duty: Modern Warfare 2 open beta and multiplayer reveal dated

    Xbox Game Pass is losing some brilliant games soon

    Xbox Game Pass is losing some brilliant games soon

  • Investment
    MiAlgae Raises £2.3M in Funding

    MiAlgae Raises £2.3M in Funding

    Skedda Receives Investment From Five Elms Capital

    Skedda Receives Investment From Five Elms Capital

    Stage 2 Capital

    Stage 2 Capital Closes $150M Fund III

    Shima Capital Closes $200M Maiden Fund to Support Web3 Founders

    Shima Capital Closes $200M Maiden Fund to Support Web3 Founders

    accenture

    Accenture Completes Acquisition of The Stable

    IsoTalent Raises USD5M in Seed Funding

    IsoTalent Raises USD5M in Seed Funding

    Arine Raises $29M in Series B Funding

    Arine Raises $29M in Series B Funding

    threatx

    ThreatX Raises $30M in Series B Funding

    VidMob Raises $110M in Series D Funding Round

    VidMob Raises $110M in Series D Funding Round

  • More
    • Data analytics
    • Apps
    • No Code
    • Cloud
    • Quantum Computing
    • Security
    • AR & VR
    • Esports
    • IOT
    • Smart Home
    • Smart City
    • Crypto Currency
    • Blockchain
    • Reviews
    • Video
No Result
View All Result
AI EXPRESS
No Result
View All Result
Home Computer Vision

Face Generation with GANs

seprameen by seprameen
December 23, 2021
in Computer Vision
0
Face Generation with GANs
0
SHARES
1
VIEWS
Share on FacebookShare on Twitter
Picture by Timon Klauser / Unsplash

Within the trendy period of excessive computational energy and superior technological developments, we’re in a position to obtain most of the complicated duties that may as soon as be deemed not possible. Generative networks are one of many strongest speaking factors of the earlier decade as a result of gravity of the fabulous outcomes that they’ve been in a position to generate efficiently. One of many duties that we are going to focus on on this article, which was additionally as soon as thought of fairly complicated to realize, is the era of life like faces with the assistance of generative adversarial networks. There have been a number of profitable deployments of fashions and totally different neural architectural builds which have managed to perform this process successfully.

This text will give attention to understanding a number of the fundamental ideas of facial era with the assistance of generative adversarial networks. We are going to attempt to accomplish the era of life like photographs with the assistance of Deep Convolutional Generative Adversarial Networks (DCGANs) that we constructed in one in all our earlier works. Earlier than shifting additional with this text, I’d suggest testing two of my previous works to remain up to date with the contents of this text. To get began with GANs, take a look at the next hyperlink – “Full Information to Generative Adversarial Networks (GANs)” and to achieve additional data on DCGANs, take a look at the next hyperlink – “Getting Began With DCGANs.”

To grasp the quite a few core ideas that we are going to cowl on this article, take a look at the desk of contents. We are going to begin off with an introduction to the era of facial buildings with GANs after which proceed to achieve additional data on a number of the pre-existing methods which were efficiently deployed to realize one of the best outcomes doable for these duties. We could have a fast overview of the DCGANs structure and methodologies to strategy the next process. After which have a code walkthrough to debate the varied steps to observe to realize a fascinating consequence. Lastly, we are going to conclude with some much-needed sections of the long run works and the forms of enhancements and developments that we will make to the constructed structure mannequin.

Introduction:

The expansion of Generative Adversarial Networks (GANs) is fast. The continual developments in these twin neural community architectures, consisting of a generator mannequin and discriminator, assist to stabilize outputs and generate actual photographs, which turn into virtually subsequent to not possible for the human eye to distinguish. The above 1024×1024 sized picture exhibits the image of a cheerful girl. Wanting on the following picture, it might come as a shock to folks new to this spectrum of deep studying that the particular person doesn’t really exist. It was generated from the web site referred to as this X does not exist, and it’s also possible to generate random faces by yourself by refreshing the next link. Every time the generator is run, a totally new life like face is generated.

After I requested a number of of my mates to find out what they thought in regards to the explicit image, virtually no person was ready to determine that the next picture was generated with the assistance of Synthetic Intelligence (AI). To be exact, the StyleGAN2 structure used to generate the next faces performs at an distinctive stage producing these outcomes indistinguishable from actual images. Solely upon additional introspection will you be capable of discover slight abnormalities within the following picture depicted above. One of many noticeable points might be the bizarre shaded-out background. One other situation might be the distinction within the two earrings which can be worn by her, but it surely may be thought of as an attention-grabbing style sense. Small errors like these are nonetheless troublesome for

It’s enjoyable to take a position how the generated picture might not be actual, however the easy reality we’re having the next dialogue is sufficient proof that the potential for generative networks for a process like facial construction era or another comparable mission is humungous and one thing to be careful for within the upcoming future. We are going to discover the big potential that these generative networks must create life like faces or another sort of photographs from scratch. The idea of two neural networks competing in opposition to one another to provide the best outcomes is an intriguing facet of research in deep studying.

Allow us to begin by understanding a number of the methodologies and pre-existing methods which can be at present used within the era of faces. As soon as we discover these pre-existing methods, we are going to go into better element on how we will produce these comparable outcomes with the assistance of deep convolutional generative adversarial networks. So, with out additional ado, allow us to proceed to know and discover these matters accordingly.


Understanding some pre-existing methods:

Image Source

The development of the expertise of GANs has been fast-faced and very profitable in a brief span of lower than a decade. An journey began out in 2014 with a easy Generative Adversarial Structure quickly discovered steady reputation by means of the quite a few years of developments and enhancements. There are tons of recent methods and methodologies which can be always being found by builders and researchers on this discipline of generative neural networks. The above picture is a sign of the fast tempo of improvement that’s achieved by the advance within the total expertise and strategies utilized in Generative Adversarial Networks. Allow us to briefly focus on a number of the implausible methods which can be at present deployed on the earth of machine studying for the aim of life like face era.

One of many main architectures that did a implausible job of engaging in this process of producing photorealistic faces from scratch was the StyleGAN structure developed by NVIDIA. Whereas these preliminary fashions produced fabulous outcomes to start out with, it was within the second iteration of the StyleGAN architectures that these fashions gained immense reputation. They mounted a number of the attribute artifacts options after thorough analysis, evaluation, and experiments. In 2019, this new model of Fashion GANs 2 was launched with a complicated mannequin structure and a number of other strategies of enchancment of their coaching process to realize one of the best outcomes until that point. To grasp and study extra about this explicit structure, I’d counsel trying on the following research paper. Lately, the third iteration of the StyleGAN structure was launched with improved outcomes and additional developments on the kind of duties it’s sometimes constructed to carry out.

One other intriguing idea is of the Face App that performs adjustments in the kind of age of an individual by altering the faces can be constructed with the assistance of a GAN. Sometimes to carry out a lot of these duties, a Cycle GAN structure is used, which has gained immense reputation as of late. These GANs carry out the motion of studying the transformation of varied photographs of various types. The Face App, for instance, transforms the generated or uploaded faces into faces of various age teams. We are going to discover each the Cycle GAN and the Fashion in future works, however for the aim of this text, we are going to keep on with our beforehand mentioned DCGANs structure to realize the duty of producing faces.


Producing Faces With DCGANs:

On this part of the article, we are going to additional discover the idea of DCGANs and find out how to generate quite a few faces with the assistance of this structure. In one in all my earlier articles on DCGANs, we’ve got lined a lot of the important ideas required for understanding and fixing a typical process with the assistance of those generative networks. Therefore, we could have a short overview of the kind of mannequin we are going to make the most of for fixing the duty of face era and reaching one of the best outcomes. Allow us to get began with the exploration of those ideas.

Overview of DCGANs:

Whereas the essential or vanilla Generative Adversarial Networks produced a number of the greatest works within the early days of generative networks, these networks weren’t extremely sustainable for extra complicated duties and weren’t as computationally efficient for performing sure operations. For this function, we use a wide range of generative community architectures which can be constructed for particular functions and carry out a specific motion extra successfully than others. On this article, we are going to make the most of the DCGANs mannequin with a deep convolutional generator structure to generate photographs. The generator with a random latent noise vector will generate the facial buildings, and the discriminator, which acts as a picture classifier, will distinguish if the output picture is actual or faux.

See also  Startup Delivers AI to Create Smart Supermarket

Whereas the working process of the deep convolutional generative adversarial networks is much like the workings of a typical GAN sort community, there are some key architectural enhancements that had been developed to provide more practical outcomes on most datasets. Many of the pooling layers had been changed with strided convolutions within the discriminator mannequin and fractional-strided convolutions within the case of the generator fashions. The usage of batch normalization layers is extensively utilized in each the generator and discriminator fashions for acquiring the soundness of the general structure. Many of the later dense layers, i.e., the absolutely linked layers, had been eliminated for a completely convolutional sort structure.

All of the convolutional layers within the discriminator mannequin are adopted by the Leaky ReLU activation operate aside from the ultimate layer with Sigmoid for classification functions. The generator mannequin makes use of the ReLU activation operate in all layers besides the ultimate layer, which makes use of the tanh operate for generative functions. Nonetheless, it’s noticeable that we’ve got made sure small adjustments in our generator structure as these are some minute alterations in trendy time which have proven some higher leads to particular use instances. The viewers and builders can be happy to check out their very own variations or keep on with the default archetypes for his or her respective initiatives in response to their necessities.

Process To Observe:

The process we are going to observe for the efficient development of this mission is to make sure that we accumulate one of the best datasets for the required process. Our choices are to make the most of high-quality datasets which can be obtainable on the web or keep on with another alternate options for performing these duties. We are going to make the most of the Celeb Faces Attributes (CelebA) Dataset to develop our facial recognition generative networks. The next dataset is on the market on Kaggle, and it is strongly recommended that you simply obtain it to proceed with the remaining contents of the article. You probably have a complicated system with the sources to compute high-end issues, then you’ll be able to check out the Celeb HQ datasets. Additionally, do not forget that the Gradient platform on Paperspace is likely one of the greatest choices for these in depth GPU-related duties.

As soon as we’ve got the dataset downloaded, be sure that you extract the img align celeba zip file within the “Pictures” listing to proceed with the following steps. We are going to make the most of the TensorFlow and Keras deep studying frameworks for this mission. If you do not have full data of those two libraries or need to refresh your fundamentals shortly, I’d suggest testing the next hyperlink for TensorFlow and this explicit hyperlink for Keras. As soon as we entry the dataset, we are going to assemble each the generator and the discriminator fashions for performing the duty of face era. The complete architectural construct will give attention to creating the adversarial framework for producing the picture and classifying probably the most life like interpretations. After the development of your complete architectural construct, we are going to begin the coaching process and save the pictures accordingly. Lastly, we are going to save the generator and discriminator fashions in order that we will reuse them another time.


Code walkthrough:

On this part of the article, we are going to discover find out how to code our face generator mannequin from scratch in order that we will make the most of these generator fashions to acquire high-quality outcomes much like a number of the state-of-the-art strategies. It’s essential to notice that on this mission, we are going to make the most of sizes of lesser dimensions (64×64) so that individuals in any respect ranges can assemble this work and will not must endure from the impediment of graphics limitations. The Gradient platform offered by Paperspace is likely one of the greatest utility choices for these complicated computational initiatives. You probably have higher gear or need to produce better-looking outcomes, then I’d suggest attempting out extra HD datasets which can be obtainable on the web. Be happy to discover the choices and select one of the best ones accordingly. For the aim of this text, we are going to stick to a easy smaller dimensional dataset that doesn’t eat an excessive amount of area. Allow us to get began with the development of this mission.

Convey this mission to life

Importing the important libraries:

The beginning step of most initiatives is to import all of the important libraries that we are going to make the most of for the event of the required mission. For the development of this face era mannequin, we are going to make the most of the TensorFlow and Keras deep studying frameworks for reaching our objectives. If you’re snug with PyTorch, that may even be one other legitimate choice to develop the mission as you need. When you aren’t acquainted with TensorFlow or Keras, take a look at the next two articles that can information you intimately on how one can get began with these libraries – “The Absolute Guide to TensorFlow” and “The Absolute Guide to Keras.” Apart from these fabulous frameworks, we will also utilize other essential libraries, such as matplotlib, numpy, and OpenCV, for performing visualizations, computations, and graphics-related operations accordingly.

import tensorflow as tf
from tensorflow import keras
from tensorflow.keras import layers
import numpy as np
import matplotlib.pyplot as plt
import os
import cv2
from tqdm import tqdm

You can feel free to utilize other libraries that you might deem necessary or suitable to perform the following task. You can also add numerous options of graphics tools or visualization graphs to track your progress and ultimately help you in improving the results. Once we have successfully imported all the required libraries that we need, we can proceed to explore the data and construct our model for solving the following task.

Exploring the data:

In this step, we will load the data into a Dataset format and map the details accordingly. The pre-processing class in the Keras allows us to access the flowing of data from a particular directory so that we can access all the images stored in the particular folder. Ensure that once you extract the celeb dataset containing over 200000 images, you place the extracted directory into another directory titled “Images” or any other folder name of your choice. The dataset will flow from the following directory, and we will assign the image size of 64 x 64 for the computation of the following data.

dataset = keras.preprocessing.image_dataset_from_directory(
    "Images", label_mode=None, image_size=(64, 64), batch_size=32
)

dataset = dataset.map(lambda x: x / 255.0)

The successful execution of the above code cell block should show a result similar to the following statement – “Found 202599 files belonging to 1 classes.” You can choose to randomly display an image to check if you have loaded the dataset precisely as planned. The code block below is one of the ways of visualizing your data.

for i in dataset:
    plt.axis("off")
    plt.imshow((i.numpy() * 255).astype("int32")[0])
    break

With the standard exploration of data and combining them effectively into your code, you can proceed to construct the discriminator and generator blocks for the computation of the face generation. Let us cover these in the upcoming sections of the article.

Constructing the Discriminator Model:

Firstly, we will construct the discriminator model for the generative network to classify the images as real or fake. The model architecture of the discriminator is exactly as discussed in the previous section. We will use the image shape of (64, 64, 3), which is the same size as the original images. The leaky ReLU will follow after each convolutional layer of varying filter sizes. Finally, after three blocks of the following combination, we will add a flatten layer, a dropout layer, and finally add the Dense layer with the sigmoid activation function to make the predictions of 0 or 1, for distinguishing between either real or fake images.

discriminator = keras.Sequential(
    [
        keras.Input(shape=(64, 64, 3)),
        layers.Conv2D(64, kernel_size=4, strides=2, padding="same"),
        layers.LeakyReLU(alpha=0.2),
        layers.Conv2D(128, kernel_size=4, strides=2, padding="same"),
        layers.LeakyReLU(alpha=0.2),
        layers.Conv2D(128, kernel_size=4, strides=2, padding="same"),
        layers.LeakyReLU(alpha=0.2),
        layers.Flatten(),
        layers.Dropout(0.2),
        layers.Dense(1, activation="sigmoid"),
    ],
    name="discriminator",
)

Note that if you are performing the following activities for different image sizes, especially for a higher resolution one, you will need to modify some of the parameters, such as the filter size, for a more precise result. However, note that there might be several GPU limitations, and you might encounter a resource exhausted error depending on the type of machine you use on your local system. The Gradient platform on Paperspace is a great option worth considering for higher resolution tasks.

See also  Achieve 35% faster training with Hugging Face Deep Learning Containers on Amazon SageMaker

Constructing the Generator Model:

The generator model of the DCGANs will be built, similarly as discussed in the overview section of this article. A few changes we will make are the modifications in the general activation function that are utilized in the generator model. Note that the ReLU functions suggested in the paper are replaced by the Leaky ReLU, similar to the ones that we used in the discriminator, and the final tanh function is replaced with the sigmoid activation. If you are looking at a higher quality of images to construct the project (like 512×512 or 1024×1024), ensure that you make the essential changes in the deconvolutional layer strides accordingly to suit the particular architecture.

latent_dim = 128

generator = keras.Sequential(
    [
        keras.Input(shape=(latent_dim,)),
        layers.Dense(8 * 8 * 128),
        layers.Reshape((8, 8, 128)),
        layers.Conv2DTranspose(128, kernel_size=4, strides=2, padding="same"),
        layers.LeakyReLU(alpha=0.2),
        layers.Conv2DTranspose(256, kernel_size=4, strides=2, padding="same"),
        layers.LeakyReLU(alpha=0.2),
        layers.Conv2DTranspose(512, kernel_size=4, strides=2, padding="same"),
        layers.LeakyReLU(alpha=0.2),
        layers.Conv2D(3, kernel_size=5, padding="same", activation="sigmoid"),
    ],
    name="generator",
)

Once you have created the generator and discriminator models, we can begin the training process of the GANs. We will also generate the images at intervals to ensure that we are receiving a brief idea of the performance of our architectural build.

Training and Generating Images:

The final step of constructing our project is to train the models for a large number of epochs. Note that I have used only five epochs for the following train and saved an image at an interval of 100. The Gradient Tape function will automatically compile both the models accordingly, and the construction phase for the following project can be started. Keep training the model until you are satisfied with the results produced by the generator.

for epoch in range(5):
    for index, real in enumerate(tqdm(dataset)):
        batch_size = real.shape[0]
        vectors = tf.random.normal(shape = (batch_size, latent_dim))
        fake = generator(vectors)
        
        if index % 100 == 0:
            Image = keras.preprocessing.image.img_to_array(fake[0])
#             path = (f"My_Images/images{epoch}_{index}.png")
#             cv2.imwrite(path, image)
            keras.preprocessing.image.save_img(f"My_Images/images{epoch}_{index}.png", Image)
            
        with tf.GradientTape() as disc_tape:
            real_loss = loss_function(tf.ones((batch_size, 1)), discriminator(real))
            fake_loss = loss_function(tf.zeros(batch_size, 1), discriminator(fake))
            total_disc_loss = (real_loss + fake_loss)/2
            
        grads = disc_tape.gradient(total_disc_loss, discriminator.trainable_weights)
        discriminator_optimizer.apply_gradients(zip(grads, discriminator.trainable_weights))
        
        with tf.GradientTape() as gen_tape:
            fake = generator(vectors)
            generator_output = discriminator(fake)
            generator_loss = loss_function(tf.ones(batch_size, 1), generator_output)
            
        grads = gen_tape.gradient(generator_loss, generator.trainable_weights)
        generator_optimizer.apply_gradients(zip(grads, generator.trainable_weights))

Once your training procedure is complete, you can proceed to save the generator and the discriminator models in their respective variables either in the same directory or another directory of your creation. Note that you can also save these models during the training process of each epoch. If you have any system issues or other similar problems that might affect the complete computation of the training process, the best idea might be to save the models at the end of each iteration.

generator.save("gen/")
discriminator.save("disc/")

Feel free to perform other experiments on this project. Some of the changes I would recommend trying out is to construct a custom class for the GANs and build a tracker of discriminator and generator loss accordingly. You can also create your checkpoint systems to monitor the best results and save the generator and discriminator models. Once you are comfortable with the given image sizes, it is worth trying out higher resolutions for better-interpreted results. Let us now look at some of the possible future works that we can focus on for further developments in this sector of neural networks and deep learning.


Future works:

Face generation is a continuously evolving concept. Even with all the discoveries and the new modern technologies that have been developed in the field of generative adversarial networks, there are is an enormous potential to exceed far beyond the confines of simple face generations. In this section of the article, we will focus on the type of future works that are currently happening in the world of deep learning and what your next steps to progress in the art of face generation with generative networks and deep learning methodologies must be. Let us analyze a couple of ideas that will help to innovate and elevate our generative models to even further superior levels.

Further advancements and improvements to our model:

We know that the model that we currently built is capable of generating pretty realistic training images after a good few epochs of training, which could potentially take anywhere between several hours of training to a few days, depending on the type of system that you use. However, we can notice that even though the model performs quite well in generating a large number of faces, the amount of control that we possess upon the choice of generation is quite less as the generator model randomly generates a facial structure. A few major improvements that we can make to our models is to be able to construct them in a way such that a lot of parameters are under our control, and we have the option to stabilize the received outputs.

Some of the parameters and attributes that you can make adjustable are human traits such as gender, age, type of emotion, numerous poses of the head, ethnicity, toning of the skin, color of the hair, the type of accessories worn (such as a cap or sunglasses), and so much more. By making all these characters variable with the Generative Adversarial Network you construct, a lot of variations are possible. An example of performing such action is noticeable in the Face Generator web site, the place you’ll be able to generate virtually 11,232,000+ variants of the identical face with quite a few combos. We are able to additionally make such developments in our coaching mannequin to realize superior outcomes whereas having first rate management over the kind of facial buildings we select to generate.

Practical Deepfakes works:

In one of many earlier sections, we mentioned how the Face App can change the age of an individual from a particular vary with the assistance of generative adversarial networks, specifically utilizing Cycle GANs. To take these experiments one step additional can be to make the most of Deepfakes in our initiatives. By making efficient use of synthetic intelligence and machine studying, specifically utilizing generative networks comparable to autoencoders and generative adversarial networks, it’s doable to generate artificial media (AI-generated media). This highly effective approach can manipulate or generate faces, video content material, or audio content material, which is extraordinarily life like, and virtually not possible to distinguish if faux or not.

With trendy Deepfakes computations, you’ll be able to generate a face and make the identical generated face carry out quite a few actions comparable to an intriguing dialogue supply or a facial motion that really by no means existed. There may be huge scope for combining our generative neural community structure with different comparable applied sciences to create one thing completely fabulous and surprising. One of many future experiments value attempting out is to assemble a generative mannequin that may generate high-quality faces with excessive precision and realism. Utilizing this generated face and mixing it with Deepfakes expertise, numerous accomplishments and enjoyable initiatives are doable. Be happy to discover the varied combos and permutations that these matters provide the builders to work with accordingly.


Conclusion:

Lego heads
Picture by Jan Huber / Unsplash

The era of faces with distinctive precision and the life like show is likely one of the biggest accomplishments of generative adversarial networks. From deeming the next process of a facial era to be virtually not possible a number of a long time in the past, to producing a median facial era in 2014, to the continual development of those facial buildings that look so life like and fabulous is a humungous achievement. The chances for these generative neural networks to take over a large portion of deep studying and create new areas of research are huge. Therefore, it’s of utmost significance for contemporary deep studying researchers to start out experimenting and studying about these phenomenal mannequin architectures.

On this article, we understood the importance of generative adversarial networks on reaching a fancy process comparable to producing life like faces of people which have by no means really existed. We lined the subject of various pre-existing methods which can be constantly deployed within the trendy improvement section to realize the next process efficiently. We then studied a short overview of our earlier DCGANs structure and understood find out how to cope with face era with these networks. After an in depth code overview for the next, we understood the process of face era with DCGANs. Lastly, we explored the big potential that these networks have by gaining extra data in regards to the future developments and enhancements that we will make to those fashions.

Within the upcoming articles, we are going to additional discover the subject of generative adversarial networks and attempt to study the various kinds of GANs which were developed by means of the years. One of many subsequent works that we are going to discover will contain the utility of super-resolution generative adversarial networks (SRGANs). Till then, get pleasure from researching, programming, and growing!

Source link

Tags: faceGANsGeneration
Previous Post

Bitcoin, Ether, And Other Crypto Tokens Could See A Less Bumpy Ride In 2022

Next Post

Global Robotic Process Automation (RPA) Market

seprameen

seprameen

Next Post
Global Robotic Process Automation (RPA) Market

Global Robotic Process Automation (RPA) Market

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Newsletter

Popular Stories

  • Cilium launches eBPF-powered Kubernetes service mesh

    Don’t overengineer your cloud architecture

    0 shares
    Share 0 Tweet 0
  • DeepFace – Most Popular Deep Face Recognition in 2022 (Guide)

    0 shares
    Share 0 Tweet 0
  • How to train and use a custom YOLOv7 model

    0 shares
    Share 0 Tweet 0
  • YOLOv7: The Fastest Object Detection Algorithm (2022)

    0 shares
    Share 0 Tweet 0
  • LG TV Owners Can Get 90 Days Of Stadia Pro For Free

    0 shares
    Share 0 Tweet 0

Computer Vision Jobs

View 115 Vision Jobs at Tesla

View 165 Vision Jobs at Nvidia

View 105 Vision Jobs at Google

View 135 Vision Jobs at Amamzon

View 131 Vision Jobs at IBM

View 95 Vision Jobs at Microsoft

View 205 Vision Jobs at Meta

View 192 Vision Jobs at Intel

Accounting and Finance Hub

Raised Seed, Series A, B, C Funding Round

Get a Free Insurance Quote

Try Our Accounting Service

AI EXPRESS

AI EXPRESS is a news site that covers the latest developments in Artificial Intelligence, Data Analytics, ML & DL, Algorithms, RPA, NLP, Robotics, Smart Homes & Cities, Cloud & Quantum Computing, AR & VR and Blockchains

Categories

  • AI
  • Ai videos
  • Apps
  • AR & VR
  • Blockchain
  • Cloud
  • Computer Vision
  • Crypto Currency
  • Data analytics
  • Esports
  • Gaming
  • Gaming Videos
  • Investment
  • IOT
  • Iot Videos
  • Low Code No Code
  • Machine Learning
  • NLP
  • Quantum Computing
  • Robotics
  • Robotics Videos
  • RPA
  • Security
  • Smart City
  • Smart Home

Quick Links

  • Reviews
  • Deals
  • Best
  • AI Jobs
  • AI Events
  • AI Directory
  • Industries

© 2021 Aiexpress.io - All rights reserved.

  • Contact
  • Privacy Policy
  • Terms & Conditions

No Result
View All Result
  • AI
  • ML
  • NLP
  • Vision
  • Robotics
  • RPA
  • Gaming
  • Investment
  • More
    • Data analytics
    • Apps
    • No Code
    • Cloud
    • Quantum Computing
    • Security
    • AR & VR
    • Esports
    • IOT
    • Smart Home
    • Smart City
    • Crypto Currency
    • Blockchain
    • Reviews
    • Video

© 2021 Aiexpress.io - All rights reserved.