feat(records): ingest endpoint by andersfylling · Pull Request #2652 · cognitedata/cognite-sdk-python

andersfylling · 2026-05-27T19:55:56Z

https://cognitedata.atlassian.net/browse/HVD-1261

Created a new PR (again), due to github issues.

gemini-code-assist

Code Review

This pull request adds support for ingesting records into data modeling streams by introducing the ingest method to both the sync and async records API, along with supporting data classes (RecordWrite, RecordSource, RecordSourceReference, and RecordWriteList). Feedback on the changes focuses on adding missing type hints to class attributes across the new data classes to comply with the repository style guide, and safely handling potential null values for sources in RecordWrite._load to prevent a TypeError.

gemini-code-assist · 2026-05-27T19:58:39Z

+    @classmethod
+    def _load(cls, resource: dict[str, Any]) -> Self:
+        return cls(
+            space=resource["space"],
+            external_id=resource["externalId"],
+            sources=[RecordSource._load(s) for s in resource.get("sources", [])],
+        )


If the API response contains sources as null (which is represented as None in Python), resource.get("sources", []) will return None. This will cause a TypeError when attempting to iterate over it. Use resource.get("sources") or [] to safely handle this case.

Suggested change

@classmethod

def _load(cls, resource: dict[str, Any]) -> Self:

return cls(

space=resource["space"],

external_id=resource["externalId"],

sources=[RecordSource._load(s) for s in resource.get("sources", [])],

)

@classmethod

def _load(cls, resource: dict[str, Any]) -> Self:

return cls(

space=resource["space"],

external_id=resource["externalId"],

sources=[RecordSource._load(s) for s in resource.get("sources") or []],

)

gemini-code-assist · 2026-05-27T19:58:39Z

+class RecordWriteList(CogniteResourceList[RecordWrite]):
+    """A list of :class:`RecordWrite` objects."""
+
+    _RESOURCE = RecordWrite


Please add a type hint for the class attribute _RESOURCE to comply with the repository style guide requirement that all class attributes must have type hints.

Suggested change

class RecordWriteList(CogniteResourceList[RecordWrite]):

"""A list of :class:`RecordWrite` objects."""

_RESOURCE = RecordWrite

class RecordWriteList(CogniteResourceList[RecordWrite]):

"""A list of :class:`RecordWrite` objects."""

_RESOURCE: type[RecordWrite] = RecordWrite

References

All functions, methods, and class attributes must have type hints. ^(link)

haakonvt · 2026-05-27T23:51:25Z

            resource_path=self._records_url(stream_id),
        )
+
+    async def ingest(self, stream_id: str, items: RecordWrite | Sequence[RecordWrite]) -> None:


I suggest we adopt the "items as posarg, stream_id as kwarg" pattern here too:

Suggested change

async def ingest(self, stream_id: str, items: RecordWrite | Sequence[RecordWrite]) -> None:

async def ingest(

self,

items: RecordWrite | Sequence[RecordWrite],

*,

stream_id: str,

) -> None:

haakonvt · 2026-05-27T23:56:23Z

+        assert list_cls is not None
+        assert resource_cls is not None
+        assert input_resource_cls is not None


mypy you silly 😆

haakonvt · 2026-05-28T00:07:08Z

+        from cognite.client.utils._identifier import RecordId
+


Suggested change

from cognite.client.utils._identifier import RecordId

haakonvt · 2026-05-28T00:11:58Z

+        from cognite.client.utils._identifier import RecordId
+


Suggested change

from cognite.client.utils._identifier import RecordId

haakonvt · 2026-05-28T00:15:06Z

+        }
+
+
+class RecordWrite(WriteableCogniteResource["RecordWrite"]):


We have used the word "Apply" in Data Modeling, i.e. NodeApply and EdgeApply, but honestly, I like Write better.

haakonvt · 2026-05-28T00:17:14Z

+        return self
+
+
+class RecordWriteList(CogniteResourceList[RecordWrite]):


This is missing an as_ids (not that write-list-type classes ever see any real use, dont even know why we have them 😆 )

haakonvt · 2026-05-28T00:19:12Z

Let's remove all Record* classes from here (cognite/client/data_classes/__init__.py) and keep them in cognite/client/data_classes/data_modeling/__init__.py.

haakonvt · 2026-05-28T00:23:02Z

+                ... )
+        """
+        self._warning.warn()
+        item_list: list[RecordWrite] = [items] if isinstance(items, RecordWrite) else list(items)


Does mypy complain about this? Would be nice to avoid making a full copy

Suggested change

item_list: list[RecordWrite] = [items] if isinstance(items, RecordWrite) else list(items)

item_list: list[RecordWrite] = [items] if isinstance(items, RecordWrite) else items

haakonvt · 2026-05-28T00:35:13Z

+        return len(self) == len({(r.space, r.external_id) for r in self._identifiers})
+
+
+class RecordSourceReference(CogniteResource):


I believe we can remove this class entirely and use the existing ContainerId

Or make a shallow subclass RecordContainerId

haakonvt · 2026-05-28T00:42:48Z

+        }
+
+
+class RecordSource(CogniteResource):


If RecordSourceReference is replaced with ContainerId, then RecordSource becomes nearly identical to NodeOrEdgeData, but I think we should keep it. The node-or-edge thingy is very bloated due to support for TypedInstance, which for the foreseeable future should not make it into Records.

So, I just suggest update source to ContainerId here.

haakonvt · 2026-05-28T00:48:21Z

One last question, the API has both ingest and upsert; what are your thoughts on keeping just ingest / both? 🤔

ingest endpoint

4287519

andersfylling force-pushed the andersf/records/ingest2 branch from 1636624 to 4287519 Compare May 27, 2026 19:56

andersfylling marked this pull request as ready for review May 27, 2026 19:57

andersfylling requested review from a team as code owners May 27, 2026 19:57

andersfylling changed the title ~~andersf/records/ingest2~~ feat(records): ingest endpoint May 27, 2026

gemini-code-assist Bot reviewed May 27, 2026

View reviewed changes

haakonvt reviewed May 28, 2026

View reviewed changes

		}


		class RecordWrite(WriteableCogniteResource["RecordWrite"]):

		return self


		class RecordWriteList(CogniteResourceList[RecordWrite]):

	item_list: list[RecordWrite] = [items] if isinstance(items, RecordWrite) else list(items)
	item_list: list[RecordWrite] = [items] if isinstance(items, RecordWrite) else items

		return len(self) == len({(r.space, r.external_id) for r in self._identifiers})


		class RecordSourceReference(CogniteResource):

Conversation

andersfylling commented May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist Bot May 27, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 27, 2026

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

haakonvt commented May 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

andersfylling commented May 27, 2026 •

edited

Loading