mse loss in pytorch geometric gives nan for loss function

Question

mse loss in pytorch geometric gives nan for loss function

549 views Asked by Md Tahmid Hasan Fuad At 25 December 2022 at 04:29

I am doing a regression problem using GCN with pytorch geometric. And I am getting nan loss while using mse loss. However, output tensor is not nan. Here is my model-

import torch
import torch.nn.functional as F
from torch_geometric.nn import GCNConv


class GCN(torch.nn.Module):
    def __init__(self):
        super().__init__()
        self.conv1 = GCNConv(data.num_node_features, 100)
        self.conv2 = GCNConv(100, 16)
        self.conv3 = GCNConv(16, data.num_node_features)
        self.linear1 = torch.nn.Linear(104,1)

    def forward(self, data):
        x, edge_index = data.x, data.edge_index

        h = self.conv1(x, edge_index)
        h = F.relu(h)
        h = F.dropout(h, training=self.training)
        h = self.conv2(h, edge_index)
        h = self.conv3(h, edge_index)
        h = self.linear1(h)
        h = h.tanh()
        return h

Here is the loop for calling the model and calculate the loss.

import torch.nn as nn
device = torch.device('cpu')
model = GCN().to(device)
model = model.double()
data = data.to(device)
optimizer = torch.optim.Adam(model.parameters(), lr=0.01, weight_decay=5e-4)
model.train()

for epoch in range(3):
    optimizer.zero_grad()
    out = model(data)
    loss = F.mse_loss(out.squeeze(), data.y.squeeze())
    loss.backward()
    optimizer.step()
    print(f'Epoch: {epoch}, Loss: {loss}')

I also tried not using tanh and using softmax in the forward pass. out tensor is not null. I printed and checked it also the length of both data.y and out is same.

As I am a novice in this GCN and pytorch geometric, I am unable to solve this problem.

Original Q&A

There are 1 answers

**Md Tahmid Hasan Fuad** · Accepted Answer · 2022-12-29T04:39:33+00:00

Md Tahmid Hasan Fuad On 29 December 2022 at 04:39 BEST ANSWER

getting nan in loss can be happened for one of following reasons-

There is nan data in the dataset.
Using relu function sometimes gives nan output. (Use leaky-relu instead)
Sometimes zero into square_root from torch gives nan output.
Using wrong loss. (eg. classification loss in regression problem)

TechQA.

mse loss in pytorch geometric gives nan for loss function

There are 1 answers

Related Questions in NAN

Related Questions in MSE

Related Questions in PYTORCH-GEOMETRIC

Related Questions in GNN

Popular Questions

Popular Tags

Trending Questions