I have an incremental dataset and would like to remove the last transaction. Below I attached a screenshot and added a border to the one I like removed. I want to remove it while preserving the dataset's "incrementality".
How do I remove the latest transaction in a Foundry incremental build/transform?
480 views Asked by Kevin Zhang At
1
There are 1 answers
Related Questions in PALANTIR-FOUNDRY
- Has the hubble:icon Type Class been deprecated?
- Quiver pivot table from linked Object sets
- Manipulate GeoJSON Data via Typescript Function in Palantir Foundry
- Rest API: Is there any endpoint that provides dataset lineage info?
- Create charts in workshop based on two objects
- Using a Code Repo to call a Webhook created in Palantir Foundry
- How do you manage static data in Palantir Foundry?
- How do I get values from Palantir Foundry Workshop app into a PDF?
- What object types are avaliable for export on <hostname>/phonograph2-export/api/?
- Run Docker Containers on an Existing Kubernetes Cluster on Palantir Foundry
- Delete all rows with duplicate values in column in Palantir Foundry Countor
- Use of ngraph.path library in Palantir - Foundry
- Generate PDF files using transforms api in code repositories and save to foundry
- Palantir Foundry - Making http request and capturing JSON response
- How to publish a Spark ML pyspark.ml.PipelineModel object in code repositories?
Related Questions in FOUNDRY-CODE-REPOSITORIES
- Generate PDF files using transforms api in code repositories and save to foundry
- How to upload large unstructured dataset into a MediaSet in Palantir Foundry?
- How to filter out specific rows of dataframe in pyspark?
- How to use Broadcast in Foundry code repository
- Python Inner Join Returns No Rows but Contour Does
- Load h5 file with keras in Foundry code repository
- How to convert a Dataset of PDFs to a Media Set?
- Is there any way in foundry by which we can validate the attachment while uploading it to ontology object type using attachment property?
- Pass a whole dataset contains multiple files to HuggingFace function in Palantir
- How to add a column as a file name to a parsed dataset in Palantir Foundry?
- How to revert/roll back to an earlier commit in Foundry Code Repo
- How do I ensure a build has consistent provenance records so it can run incrementally?
- check if rows are already present in pyspark dataset
- Palantir Foundry fail to set link between two objects
- How do I update part of a dataset without doing a snapshot build of the whole dataset?
Related Questions in PALANTIR-FOUNDRY-API
- Rest API: Is there any endpoint that provides dataset lineage info?
- Using a Code Repo to call a Webhook created in Palantir Foundry
- Run Docker Containers on an Existing Kubernetes Cluster on Palantir Foundry
- Generate PDF files using transforms api in code repositories and save to foundry
- ConjureHTTPError -- How to resolve?
- Accessing Schedule Information through Foundry's API
- Palantir Foundry - can we add object attachments in python transform?
- How can I replace the path value with the parameter passed?
- Palantir Foundry: Composite-Key for DataSet-Rows / Object-Instance Primary Key
- How to add a column as a file name to a parsed dataset in Palantir Foundry?
- Palantir Foundry - How to load PDF files from Compass folder into code repository transform
- Palantir Foundry - Data Connection to copy PDF files from Compass Folder to external source?
- XLSX file extraction in Palantir Foundry
- How can I use the Palantir Foundry REST API to get a list of datasets within a directory
- How do I migrate a java API to Foundry?
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)

Yes, it is possible to delete one or several transactions in your current dataset which is incrementally built without breaking its incrementality.
The only way to delete a transaction is to use Foundry API calls. If you are not familiar with APIs, please find here the guidelines and we would strongly recommend you trying instructions on a test dataset first until you are comfortable with the process.
The options available depend on your downstream datasets:
SCENARIO 1: Your downstream datasets are running incrementally
You can roll-back your dataset to the latest successful transaction by using the API in foundry's Catalog API "updateBranch2" (branchesUpdate2) please find additional information in this StackOverflow Thread:
The result is that your downstream datasets will continue to run incrementally.
SCENARIO 2: If your downstream datasets are NOT running incrementally
You can remove specific files.
Therefore, for deleting specific files, you will have to use the following steps:
Use create transaction with a transaction type of DELETE
<DATASET_RID> you can find the Dataset RID in your URL.
ex. ri.foundry.main.dataset.c26f11c8-cdb3-4f44-9f5d-9816ea1c82da
Add files to Delete Transaction by listing and opening the logical paths of the files to delete
You can get the filepaths from the dataset Details tab under Files
ex: spark/part-00000-d5e90287-22bd-4840-a6a0-6eb1d98d0af3-c000.snappy.parquet
<TRANSACTION_RID> the has been sent as a response body of the first API call
Commit your transaction
curl -X POST
-H "Content-type: application/json"
-H "Authorization: Bearer $TOKEN"
"https://$HOSTNAME/api/v1/datasets/$DATASET_RID/transactions/$TRANSACTION_RID/commit“
At any time, you can abortTransaction or get the files currently in your transaction with getFilesInTransactionPaged2.
Committing a DELETE transaction does not delete the underlying file from the backing file system—it simply removes the file reference from the dataset view.
DELETE transactions are breaking incrementality. Therefore, if this dataset is used on downstream incremental datasets, this action will break incrementality of their builds.