Replies: 3 comments 4 replies
-
I'm thinking that we'd need to add a |
Beta Was this translation helpful? Give feedback.
-
Thanks so much Brett for raising this. This is a critical issue we need to resolve as a priority. Your examples highlight a very specific problem. But I think this problem is actually more genenral: any conflict (even just two patches editing the same field) would result in the same kind of problem. We originally wanted to implement patch because in a way it almost has a built-in conflict resolution mechanism (two clients sending two patches updating two different fields of the same resource would not actually generate any conflict, the conflict is "automatically" resolved, but two clients sending two updates would for sure cause conflicts on the server). But what you have highlighted here is that there will be cases where this "auto resolution" causes unforeseen problems (like in your example, you end up updating a different item in a repeated field if someone else changes the index of the item...) @aditya-07 do you have anything to add? |
Beta Was this translation helpful? Give feedback.
-
To avoid data corruption in peer-to-peer sync, it's important to apply patches conditionally based on the resource version. This can be done by using version-specific PATCH operations, preventing inappropriate actions. It would also be beneficial to implement optimistic locking to prevent accidental overwrites. Conflicts are rare, but conflict resolution mechanisms should still be included. Documenting these practices and updating the reference app to include them would help avoid these problems. |
Beta Was this translation helpful? Give feedback.
-
As I've been going through the current implementation of server sync, and thinking about approaches to peer-to-peer sync, I've come across a potential pitfall which I think needs to be highlighted.
The current FHIR engine implementation tracks local changes using JSON Patches, and synchronises these with the server using the FHIR
patch
interaction.I've written up the following scenario to try and illustrate the issue:
When starting with this example patient:
If one client changes the patient's work phone number it results in the following patch:
And if a different client removes the patient's home phone number it results in the following patch:
Then there are two possible outcomes depending on the order in which the patches are synced to the server or from peer to peer.
If they are synced in the order listed above, then the result is what we'd expect:
(The home phone number has been removed, and the work phone number has been updated.)
But if they're synced in the opposite order, then we end up with an invalid result:
(The home phone number has been removed, but the value of the email address was updated instead of that of the work phone number.)
It does not matter when each of the changes was made, only the order in which they were synced.
To avoid ending up with resources in unexpected states, patches need to be applied conditionally based on the version of the resource which each client was making changes to. This is pointed out in the FHIR docs:
Doing this, however, introduces the possibility of conflicts, and the need for conflict resolution. It also negates some of the benefits I'd perceived in using PATCH over PUT.
I've read through the discussion in #472 relating to conflicts where the point was made that conflicts are rare and can be dealt with operationally. I agree with the decision there, but as pointed out above - without special consideration by implementers, the current API allows for implementations which can result in data corruption rather than conflicts. At the moment neither the reference app nor the other implementations which I've looked at take this behaviour into consideration. So I see this as a bit of a footgun, and I'm wondering if there's something which can be done to guide implementers into the pit of success.
Is there a good place to document this? And should the reference app be updated to apply optimistic locking as a good practice?
Any other thoughts are welcome too since this is something which needs to be considered in the implementation of P2P sync.
Beta Was this translation helpful? Give feedback.
All reactions