[source-mysql] Sync Stops / Infinite Hanging on UI - Out of Memory because of Zero Dates in Table Column #51014
Labels
area/connectors
Connector related issues
community
connectors/source/mysql
team/db-dw-sources
Backlog for Database and Data Warehouse Sources team
type/bug
Something isn't working
Connector Name
source-mysql
Connector Version
3.9.4
What step the error happened?
During the sync
Relevant information
Issue
Hi!
We are using airbyte to sync alot of different types of sources (MSSQL, MySQL, CSV Files, among others) and we are having an issue using the MySQL Source.
We noticed that some of our tables have "Zero Dates" ('0000-00-00' for example) in them, instead of null. We noticed that the source has the behaviour of changing these invalid dates to NULL when writing the DESTINATION / OUTPUT. <<< This is ideal! It's something that we want to happen.
The issue in question is that, when the SYNC is running, and a table has TOO many Zero Date rows, we noticed that the SYNC log gets bombed with INFO logs about each row and, after a while, the sync "freezes". Stops generating more logs.
We managed to read the log by accessing the "airbyte-abctl-control-plane" container, created by ABCTL, and via "kubectl logs -n airbyte-abctl replication-job-xxxx-attempt-x -f" command, we reached the end of the execution and its displayed that and OUT OF MEMORY has occured and the processed was killed.
After this OOM problem, the only way to "move" past is canceling the SYNC in the airbyte UI.
We got this problem so far from two distinct MySQL database, and in both cases, there were tables with millions or rows that have this kind of Zero Date problem...
We think that some variable / state / cache or something is accumullating when treating these database rows in general, and after a while, its running out of memory in the process that was created...
We didn't got this kind of behaviour when syncing other tables... We even made a test where a connection with 20 tables was not syncing, and after removing the only table with Zero Dates (2 columns ~ Aproximate 1.2 million rows), and running all the other 19, the sync was successfull.
Airbyte Version
1.3.1 - Latest install using ABCTL
Abctl Version
v.0.24.0
MySQL Source Version
3.9.4
Example of INFO generated before the Out of Memory error
Images
I'm sending together with this message, 3 images... 1 showing the pod as "completed", 1 showing the error inside the log, 1 showing the SYNC "hanging" in the airbyte UI after the Out of Memory Error.
Relevant log output
Contribute
The text was updated successfully, but these errors were encountered: