Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Aborted (core dumped) #16

Open
py-zhai opened this issue Apr 4, 2024 · 3 comments
Open

Aborted (core dumped) #16

py-zhai opened this issue Apr 4, 2024 · 3 comments

Comments

@py-zhai
Copy link

py-zhai commented Apr 4, 2024

您好,python 3.8, torch1.7, dgl 0.7.2, 显卡24G,内存160G, 请问,我在跑这个代码中的new_data.py时,用Movie数据集,只取样50条样本的时候能通过,100条样本就会出现下面报错,是什么原因只能跑这么少的样本呢?大家有没有遇上类似的问题,急求解决,谢谢

terminate called after throwing an instance of 'dmlc::Error'
what(): [11:07:45] /opt/dgl/src/array/cpu/./rowwise_pick.h:89: Check failed: rid < mat.num_rows (2 vs. 1) :
Stack trace:
[bt] (0) /root/miniconda3/lib/python3.8/site-packages/dgl/libdgl.so(dmlc::LogMessageFatal::~LogMessageFatal()+0x4f) [0x7f9ab4cb332f]
[bt] (1) /root/miniconda3/lib/python3.8/site-packages/dgl/libdgl.so(+0x59b122) [0x7f9ab4d03122]
[bt] (2) /root/miniconda3/lib/python3.8/site-packages/torch/lib/libgomp-7c85b1e2.so.1(GOMP_parallel+0x3f) [0x7f9bba37b01f]
[bt] (3) /root/miniconda3/lib/python3.8/site-packages/dgl/libdgl.so(dgl::aten::COOMatrix dgl::aten::impl::CSRRowWisePick(dgl::aten::CSRMatrix, dgl::runtime::NDArray, long, bool, std::function<void (long, long, long, long const*, long const*, long*)>)+0x29a) [0x7f9ab4d0353a]
[bt] (4) /root/miniconda3/lib/python3.8/site-packages/dgl/libdgl.so(dgl::aten::COOMatrix dgl::aten::impl::CSRRowWiseTopk<(DLDeviceType)1, long, long>(dgl::aten::CSRMatrix, dgl::runtime::NDArray, long, dgl::runtime::NDArray, bool)+0x133) [0x7f9ab4d0c273]
[bt] (5) /root/miniconda3/lib/python3.8/site-packages/dgl/libdgl.so(dgl::aten::CSRRowWiseTopk(dgl::aten::CSRMatrix, dgl::runtime::NDArray, long, dgl::runtime::NDArray, bool)+0x426) [0x7f9ab4c93ef6]
[bt] (6) /root/miniconda3/lib/python3.8/site-packages/dgl/libdgl.so(dgl::sampling::SampleNeighborsTopk(std::shared_ptrdgl::BaseHeteroGraph, std::vector<dgl::runtime::NDArray, std::allocatordgl::runtime::NDArray > const&, std::vector<long, std::allocator > const&, dgl::EdgeDir, std::vector<dgl::runtime::NDArray, std::allocatordgl::runtime::NDArray > const&, bool)+0x1364) [0x7f9ab547fc04]
[bt] (7) /root/miniconda3/lib/python3.8/site-packages/dgl/libdgl.so(+0xd1bf3a) [0x7f9ab5483f3a]
[bt] (8) /root/miniconda3/lib/python3.8/site-packages/dgl/libdgl.so(+0xd1c694) [0x7f9ab5484694]

Aborted (core dumped)

@139352
Copy link

139352 commented Jul 23, 2024

你好 请问你的cuda版本是多少呀 我运行的时候总是会报错这个Check failed: allow_missing: Device API cuda is not enabled. Please install the cuda version of dgl.

@py-zhai
Copy link
Author

py-zhai commented Jul 23, 2024

torch 1.7.0+cu110 和 torch1.13.1+cu116

@139352
Copy link

139352 commented Jul 24, 2024

好的 谢谢啦

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants