STAGE REPLICATION IN A CLOUD DATA LAKE
    2.
    发明公开

    公开(公告)号:US20230214405A1

    公开(公告)日:2023-07-06

    申请号:US18119775

    申请日:2023-03-09

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/27 G06F16/9566 G06F16/254

    Abstract: The embodiments described herein provide means for replicating external stages between deployments of e.g., a cloud data lake using a modified storage integration. The modified storage integration may be defined with a set of storage locations, wherein the storage integration comprises a base URL for each of the set of storage locations and wherein each storage location identifies a remote deployment where a cloud platform is hosted and a geographic region of the remote deployment. An external stage object may be bound to the storage integration, wherein the external stage object facilitates a data loading operation that is currently in progress on the first storage location. In response to detecting an outage at the first storage location, the data loading operation that was in progress on the first storage location may be continued at the second storage location using the storage integration.

    Stage replication in a cloud data lake

    公开(公告)号:US11620307B2

    公开(公告)日:2023-04-04

    申请号:US17396576

    申请日:2021-08-06

    Applicant: Snowflake Inc.

    Abstract: Described herein are techniques for replicating external stages between deployments of e.g., a cloud data lake using a modified storage integration. The modified storage integration may be defined with multiple storage locations that it can point to, as well as a designation of an active storage location. The storage integration may also be defined with base file paths for each storage location as well as a relative file path which together may serve to synchronize data loading operations between deployments when e.g., a fail-over occurs from one deployment to another. The storage integration may be replicated from a first deployment to a second deployment, and when database replication occurs, an external stage may be replicated to the second deployment and bound to the replicated storage integration. Thus, a fail-over to the second deployment may result in a seamless transition of data loading processes to the second deployment.

    Population of file-catalog table for file stage

    公开(公告)号:US11494438B2

    公开(公告)日:2022-11-08

    申请号:US17645415

    申请日:2021-12-21

    Applicant: Snowflake Inc.

    Abstract: Disclosed herein are systems and methods for population of a file-catalog table for a file stage in a user account on a data platform. In an embodiment, a data platform receives, from a client associated with a user account, a request to populate a file-catalog table of the user account based on a plurality of files stored in a file stage of the user account. The data platform responsively executes a list-files table function with respect to the file stage to generate a database-table object having a row for each file stored in the file stage. The data platform populates the file-catalog table of the user account based on the database-table object generated by the list-files table function.

    FILE-CATALOG TABLE FOR FILE STAGE

    公开(公告)号:US20210406310A1

    公开(公告)日:2021-12-30

    申请号:US17244349

    申请日:2021-04-29

    Applicant: Snowflake Inc.

    Abstract: Disclosed herein are systems and methods for implementing a file-catalog table for a file stage. In an embodiment, a data platform receives, from a client, a request that the data platform create a file-catalog table for a file stage of a user account associated with the client, where the file stage contains one or more files at a storage location. The user account includes a file-stage object that corresponds to the file stage and that includes an indication of the storage location. In response to receiving the request, the data platform creates the file-catalog table for the file stage. The file-catalog table lists the one or more files in the file stage, and includes at least two different metadata properties of the files listed in the file-catalog table.

    SYNCHRONIZING FILE-CATALOG TABLE WITH FILE STAGE

    公开(公告)号:US20250068676A1

    公开(公告)日:2025-02-27

    申请号:US18933173

    申请日:2024-10-31

    Applicant: Snowflake Inc.

    Abstract: Disclosed herein are embodiments of systems and methods for synchronizing file-catalog table with a file stage. In an embodiment, a data platform receives a notification of a modification to one or more files in a file stage. The file stage includes data storage having a storage location. The data platform updates, based on receiving the notification of the modification, a first file-catalog table for the file stage based on the modification. The first file-catalog table includes a row for each file in the file stage, as well as a column for each of one or more metadata properties of the one or more files in the file stage.

Patent Agency Ranking