How to implement a detach operation similar to Pytorch? #1138

zaoanhh · 2024-12-17T02:36:47Z

I've been doing some work recently using Lux for unsupervised training, and I think Lux is pretty cool. But I encountered some problems: I used to use the output of the model to generate some labels for subsequent calculations. In Pytorch it looks like:

import torch
import torch.nn.functional as F
y = model(x)
y1 = y.detach()
softmax_y1 = F.softmax(y1, dim=1)
pred_class_indices = torch.argmax(softmax_y1, dim=1)
num_classes = 4
labels_true = F.one_hot(pred_class_indices, num_classes=num_classes)

In Lux, how should I implement the detach operation to avoid the gradient being tracked during label generation?

avik-pal · 2024-12-17T05:28:21Z

You do ChainRulesCore.ignore_derivatives(y)

zaoanhh · 2024-12-19T06:34:28Z

You do ChainRulesCore.ignore_derivatives(y)

Thank you!

avik-pal closed this as completed Dec 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to implement a detach operation similar to Pytorch? #1138

How to implement a detach operation similar to Pytorch? #1138

zaoanhh commented Dec 17, 2024

avik-pal commented Dec 17, 2024

zaoanhh commented Dec 19, 2024

How to implement a detach operation similar to Pytorch? #1138

How to implement a detach operation similar to Pytorch? #1138

Comments

zaoanhh commented Dec 17, 2024

avik-pal commented Dec 17, 2024

zaoanhh commented Dec 19, 2024