While Apache tez reuses containers,what is the process takes place. Can anyone explain me clearly?
How container reuse works in Apache Tez? While reusing what is the data stored in shared location?
673 views Asked by Ramprakash At
1
There are 1 answers
Related Questions in APACHE
- Special access rule in an .htaccess file for IP addresses, authorized only for one directory structure
- How to isolate PHP apps from each other on a local machine(Windows or Linux)?
- Cannot load modules/mod_dav_svn.so into server
- How to ignore case in regexp mapping in a .htaccess rewrite rule?
- Oracle Http server ISNT-07551
- I cant access file directory with PHP local host on XAMPP. it just shows one of the files I have in my visual studio code
- Apache Reverse Proxy: only one proxy directive is working. Second one is ignored
- Issue with Django --> Apache WSGI deployment
- changing the node version used by apache web server
- Apache: How can I redirect to a subfolder with a URL param but serve required content via the main URL?
- Why/How does Apache auto-include "DHE" TLS1.2 ciphers while nginx needs "dhparams" file?
- Set up MX records in apache/Ubuntu to point to external mail server
- How to proxy to another port?
- Php can not upload file out of /var/www/html even after disabling Selinux
- Serve static site on S3 + CloudFlare with Apache retaining the source URL
Related Questions in CONTAINERS
- AttributeError: module 'numba' has no attribute 'generated_jit'
- Docker container does not find System Daemon of nordvpn after reboot
- How container isolation is being achieved for windows containers which use process isolation mode?
- Installing dotnet8 on amazonlinux 2023 image through dockerfile
- Mount Azure file share on Azure container app
- Spring Boot application container memory footprint (Java 21)
- Error initializing a docker container after installing transmission-daemon and nordvpn
- How to ping IPv6 address of link-local from container attached to bridge network
- C++ ordered map optimized with index access
- How to run a script after mysql container initialization?
- Azure Form Intelligence Connected Container performance
- Attaching a debugger to a container/instance running on a K8s Pod?
- Connecting Azure container app Spring boot backend to Azure container app Neo4j database
- Visual Studio 2022 free certificate problem. "cannot import key file " how to fix
- Open VS code from terminal in devcontainer?
Related Questions in TEZ
- Preemption with Tez along with the yarn FairShare scheduler supported?
- Way to get current editor id (doc id) in HiveServerClient in Hue source code
- Hive’s dynamic partitioning failing to write final files
- Hive complex data type querying
- Failed Vertex re-running
- How many number of reducers will be running if I use distinct on all columns in hive?
- Poor performance on hash joins with Pig on Tez
- Hive - Select count(*) not working with Tez with but works with MR
- Hive Vertex failed, vertexName=Map
- Hive + Tez :: A join query stuck at last 2 mappers for a long time
- hive llap - which execution engine supported? spark,mr, tez
- Hive - How to know which execution engine I am currently using
- Hadoop Hive TEZ task fails (to start container)
- Pig script on aws emr with tez occasionally fails with OutOfMemoryException
- OOM in tez/hive
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Please read Hortonworks (most significant Tez contributor) docs here: https://hortonworks.com/blog/re-using-containers-in-apache-tez/
Each vertex in Tez specifies parameters, which are used when launching containers. These include the requested resources (memory, CPU etc), YARN LocalResources, the environment, and the command line options for tasks belonging to this Vertex. When a container is first launched, it is launched for a specific task and uses the parameters specified for the task (or vertex) – this then becomes the container’s signature. An already running container is considered to be compatible for another task when the running container’s signature is a superset of what the task requires.
The Tez scheduler works with several parameters to take decisions on task assignments – task-locality requirements, compatibility of containers as described above, total available resources on the cluster, and the priority of pending task requests.
When a task completes, and the container running the task becomes available for re-use – a task may not be assigned to it immediately – as tasks may not exist, for which the data is local to the container’s node. The Tez scheduler first makes an attempt to find a task for which the data would be local for the container. If no such task exists, the scheduler holds on to the container for a specific time, before actually allocating any pending tasks to this container.
Each Tez JVM (or container) contains an object cache, which can be used to share data between different tasks running within the same container. This is a simple Key-Object store, with different levels of visibility/retention. Objects can be cached for use within tasks belonging to the same Vertex, for all tasks within a DAG, and for tasks running across a Tez Session.