I created a hive table with bloom filters on 4 different columns and decided later to add a few more using the alter command.
But I am not sure how to refresh/regenerate the bloom filter on Hive.
Is the bloom filter created during insertion of data?
Is it created when we gather stats? Column or table level?
Or am I completely off on my understanding of bloom filters and it is created on the fly?
I have read the documentation and havent found more information about this. Tried going through the code with no luck and finding where the methods are triggered.
You can do this Hive 0.10.0 and later using ANALYZE TABLE command update STATISTICS .
ie.
Note: bloom filter are create while inserting data