You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Feb 25, 2022. It is now read-only.
On running the main.py script using pre-trained 1.3B model with the --predict flag on, the runtime is stuck for hours after printing Done with copy master to slices., and the predictions are not generated.
To Reproduce
Steps to reproduce the behavior:
Download pre-trained 1.3B model from https://mystic.the-eye.eu/public/AI/gptneo-release/GPT3_XL/ using wget
Create a file with prompt text sample_prompt.txt
Edit config file at ./GPT_1_3B/mystic.the-eye.eu/public/AI/gptneo-release/GPT3_XL/config.json. set "mesh_shape" : "x:1,y:1" (accprding to gpu devices), set model_path to GPT_1_3B/mystic.the-eye.eu/public/AI/gptneo-release/GPT3_XL/
From root directory of the repository, run python3 main.py --predict --prompt sample_prompt.txt --gpu_ids 'device:GPU:0' --model "/home/sanchi/GPTNeo/GPT_1_3B/mystic.the-eye.eu/public/AI/gptneo-release/GPT3_XL/config.json"
Describe the bug
On running the main.py script using pre-trained 1.3B model with the
--predict
flag on, the runtime is stuck for hours after printingDone with copy master to slices.
, and the predictions are not generated.To Reproduce
Steps to reproduce the behavior:
https://mystic.the-eye.eu/public/AI/gptneo-release/GPT3_XL/
usingwget
sample_prompt.txt
./GPT_1_3B/mystic.the-eye.eu/public/AI/gptneo-release/GPT3_XL/config.json
. set"mesh_shape" : "x:1,y:1"
(accprding to gpu devices), setmodel_path
toGPT_1_3B/mystic.the-eye.eu/public/AI/gptneo-release/GPT3_XL/
python3 main.py --predict --prompt sample_prompt.txt --gpu_ids 'device:GPU:0' --model "/home/sanchi/GPTNeo/GPT_1_3B/mystic.the-eye.eu/public/AI/gptneo-release/GPT3_XL/config.json"
Expected behavior
Generate predicted text
Runtime Logs
Environment:
The text was updated successfully, but these errors were encountered: