Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Calico node failed to start due to no such file or directory name="/usr/local/bin/felix-plugins/felix-api-proxy" #476

Open
dylan2intel opened this issue Dec 23, 2022 · 11 comments

Comments

@dylan2intel
Copy link

dylan2intel commented Dec 23, 2022

Environment

  • Calico/VPP version: both v3.23 and v3.24
  • Kubernetes version: 1.24
  • Deployment type: bare-metal...
  • Network configuration: Calico /VXLAN enabled

Issue description
2022-12-23 07:54:43.193 [WARNING][32] tunnel-ip-allocator/param_types.go 294: Path lookup failed error=exec: "/usr/local/bin/felix-plugins/felix-api-proxy": stat /usr/local/bin/felix-plugins/felix-api-proxy: no such file or directory name="/usr/local/bin/felix-plugins/felix-api-proxy"
2022-12-23 07:54:43.193 [ERROR][32] tunnel-ip-allocator/param_types.go 299: Executable missing
2022-12-23 07:54:43.193 [ERROR][32] tunnel-ip-allocator/config_params.go 559: Invalid (required) config value. error=Failed to parse config parameter DataplaneDriver; value "/usr/local/bin/felix-plugins/felix-api-proxy": missing file source=environment variable
2022-12-23 07:54:43.193 [PANIC][32] tunnel-ip-allocator/allocateip.go 776: Failed to parse Felix environments error=Failed to parse config parameter DataplaneDriver; value "/usr/local/bin/felix-plugins/felix-api-proxy": missing file
panic: (*logrus.Entry) 0xc00041f650

goroutine 1 [running]:
github.com/sirupsen/logrus.(*Entry).log(0xc0010ffe18, 0x0, {0xc0008fe480, 0x22})
/go/pkg/mod/github.com/sirupsen/[email protected]/entry.go:259 +0x24f
github.com/sirupsen/logrus.(*Entry).Log(0xc00041f5e0, 0x0, {0xc0010ffe18, 0x2a423c0, 0xc000489810})
/go/pkg/mod/github.com/sirupsen/[email protected]/entry.go:293 +0x4f
github.com/sirupsen/logrus.(*Entry).Panic(...)
/go/pkg/mod/github.com/sirupsen/[email protected]/entry.go:331
github.com/projectcalico/calico/node/pkg/allocateip.loadFelixEnvConfig()
/go/src/github.com/projectcalico/calico/node/pkg/allocateip/allocateip.go:776 +0xa5
github.com/projectcalico/calico/node/pkg/allocateip.Run(0xc000914420)
/go/src/github.com/projectcalico/calico/node/pkg/allocateip/allocateip.go:68 +0x7b
main.main()
/go/src/github.com/projectcalico/calico/node/cmd/calico-node/main.go:169 +0x518
Calico node failed to start

To Reproduce
Steps to reproduce the behavior:

Expected behavior
Calico node successfully to start

Additional context
Add any other useful information about the problem or your deployment here.

@sknat
Copy link
Collaborator

sknat commented Jan 5, 2023

Hi @dylan2intel
/usr/local/bin/felix-plugins/ should be mounted in the felix pod by the operator and the felix-api-proxy binary copied in this folder when the calico/vpp agent pod starts.

On startup the calico/vpp agent will wait for the calico/vpp vpp manager to start. What probably happens is something is preventing your VPP to start, the agent waits indefinitely, and never creates the proxy binary

Do you see anything wrong in the output of kubectl -n calico-vpp-dataplane logs calico-vpp-node-XXXX -c vpp ?

@chairwa
Copy link

chairwa commented Nov 22, 2023

problem reason might be that the following cmd was not executed:
kubectl create -f calico-vpp.yaml

@kdhanawade21
Copy link

I am also facing same issue.
Anyone got resolution for this?
Below are the logs.

2024-11-05 09:41:33.558 [INFO][23] tunnel-ip-allocator/param_types.go 289: No executable in my directory myDir="/usr/bin" name="/usr/local/bin/felix-plugins/felix-api-proxy"
2024-11-05 09:41:33.558 [INFO][23] tunnel-ip-allocator/param_types.go 297: Looking for executable on path name="/usr/local/bin/felix-plugins/felix-api-proxy"
2024-11-05 09:41:33.558 [WARNING][23] tunnel-ip-allocator/param_types.go 301: Path lookup failed error=exec: "/usr/local/bin/felix-plugins/felix-api-proxy": stat /usr/local/bin/felix-plugins/felix-api-proxy: no such file or directory name="/usr/local/bin/felix-plugins/felix-api-proxy"
2024-11-05 09:41:33.558 [ERROR][23] tunnel-ip-allocator/param_types.go 306: Executable missing
2024-11-05 09:41:33.558 [ERROR][23] tunnel-ip-allocator/config_params.go 659: Invalid (required) config value. error=Failed to parse config parameter DataplaneDriver; value "/usr/local/bin/felix-plugins/felix-api-proxy": missing file source=environment variable
2024-11-05 09:41:33.558 [PANIC][23] tunnel-ip-allocator/allocateip.go 836: Failed to parse Felix environments error=Failed to parse config parameter DataplaneDriver; value "/usr/local/bin/felix-plugins/felix-api-proxy": missing file
panic: (*logrus.Entry) 0x4000ae12d0

@kdhanawade21
Copy link

Hi @dylan2intel,
Are you able to resolve this issue?
This issue is not there on x86 architecture. I am facing this on ARM architecture.
Please let me know if you are able to resolve this issue.
Thanks in advance.

@onong
Copy link
Collaborator

onong commented Nov 22, 2024

Hi @kdhanawade21, pls go through the comments from sknat and chairwa above and see if that helps.

@kdhanawade21
Copy link

kdhanawade21 commented Nov 26, 2024

Hi @onong,
Thanks for noticing my question.
Your comment helped to narrow down the issue.
I can see that issue is with starting of calico VPP agent.
And failure is because of image architecture.
I am trying to bring up VPP dataplane on ARM architecture.
But on docker hub images for vpp-agent and vpp are AMD compatible.

calicovpp/vpp                             v3.29.0           8d476285473a   2 weeks ago     647MB
calicovpp/agent                           v3.29.0           1f4b0ac01201   2 weeks ago     244MB
root@localhost:~# docker inspect 8d476285473a | grep Arch
        "Architecture": "amd64",
root@localhost:~# docker inspect 1f4b0ac01201 | grep Arch
        "Architecture": "amd64",
root@localhost:~#

Can anyone share path for ARM images for calico vpp-agent and vpp?
Or is there any guide to prepare docker images for vpp-agent and vpp for ARM architecture?

@onong
Copy link
Collaborator

onong commented Nov 28, 2024

Hi @kdhanawade21, pls replace the the images in the Calico/VPP manifest yaml with the following and see how it goes:

image: docker.io/onong/vpp:arm64
image: docker.io/onong/agent:arm64

Disclaimer: arm is not tested at all.

@kdhanawade21
Copy link

Thanks @onong ,
I compiled vpp-dataplane natively on ARM and now able to use ARM images.

@jtollet
Copy link
Collaborator

jtollet commented Jan 21, 2025

That is great!
Question, @kdhanawade21 does Marvell have a DMA that could be used to speed up memif interfaces? Intel has DSA which is now integrated with memif and VCL. This provided interesting performance speedup.

@akamaluddin
Copy link

@jtollet We have multiple hardware acceleration options that can work with memif. However, these have not yet been integrated with memif and VCL. We are currently working on this integration and plan to upstream this support later this year.

@jtollet
Copy link
Collaborator

jtollet commented Jan 21, 2025

Thanks for the update. I am curious to see how it will perform with imix traffic.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

7 participants