Scheduling Airflow operators on the same Kubernetes node with autoscalingHow to reduce nodes(vm) running in a Kubernetes cluster of GKE gracefully?GKE cluster autoscaler vs Autoscaling in Managed instance groupsHow does Kubernetes handle preemtible nodes going offline on GKE?kubelet restarting randomlyHow to best run Apache Airflow tasks on a Kubernetes cluster?Scheduling kube-dns on dedicated node poolPrevent Kubernetes rescheduling hundreds of pods when a node is failingAirflow Scheduler out of memory problemsKubernetes Node disconnected from masterkubernetes scheduling for expensive resources

Are unclear "take-it or leave-it" contracts interpreted in my favor?

3D print appears to print very weak walls in long print

Is there any word for "disobedience to God"?

What does "un pareil soin" mean here?

Why does the U.S. tolerate foreign influence from Saudi Arabia and Israel on its domestic policies while not tolerating that from China or Russia?

Is "My neighbor Mr. Lee is 70 years old who plays basketball every day." grammatical?

Why does my String turn into Integers instead of letters after I add characters with +?

Is it worth upgrading to 28mm tyres from 25 for a 250 mile ride?

Is lack of functional requirements agile?

Should disabled buttons give feedback when clicked?

Why do people keep referring to Leia as Princess Leia, even after the destruction of Alderaan?

Find the last element of a list under conditions

Is there a word for a message that is intended to be intercepted by an adversary?

How would my creatures handle groups without a strong concept of numbers?

Matchmaker, Matchmaker, make me a match

Maximum charterer insertion

Are randomly-generated passwords starting with "a" less secure?

Why do players in the past play much longer tournaments than today's top players?

During copyediting, journal disagrees about spelling of paper's main topic

Is Trump personally blocking people on Twitter?

Why are they 'nude photos'?

What would be the ideal melee weapon made of "Phase Metal"?

Is Arc Length always irrational between two rational points?

If the railway suggests a 5-min connection window for changing trains in the Netherlands, does that mean it's definitely doable?



Scheduling Airflow operators on the same Kubernetes node with autoscaling


How to reduce nodes(vm) running in a Kubernetes cluster of GKE gracefully?GKE cluster autoscaler vs Autoscaling in Managed instance groupsHow does Kubernetes handle preemtible nodes going offline on GKE?kubelet restarting randomlyHow to best run Apache Airflow tasks on a Kubernetes cluster?Scheduling kube-dns on dedicated node poolPrevent Kubernetes rescheduling hundreds of pods when a node is failingAirflow Scheduler out of memory problemsKubernetes Node disconnected from masterkubernetes scheduling for expensive resources






.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty margin-bottom:0;








0















I am using Airflow to compose and schedule long-running batch jobs. I hoped to take advantage of Kubernetes (GKE) cluster autoscaling by using either KubernetesExecutor or KubernetesPodOperator.



I played with GKE node pools and node selectors, and it seems to work as expected by scaling the number of worker nodes up and down to zero. However, some Airflow operators in my old pipeline share data via a file system, which should not work out of the box in Kubernetes as the pods may get scheduled on different nodes.



One solution I can think of is to fuse dependant operators in a single job like a one bash script but this way I lose the benefit of reusing existing Airflow operators. Another approach might be using something like NFS to share a staging directory between multiple workers. I suppose this might have performance and scalability issues.



QUESTION:
Is it possible to direct Kubernetes to schedule pods on the same node so that Airflow operators can share data via the host file system? I tried to use pod affinity for that, but in my case, it didn't help to get operators rescheduled on the same node.










share|improve this question
























  • Did you try with antiAffinity/other affinities? banzaicloud.com/blog/k8s-affinities

    – Veerendra
    Mar 26 at 7:05


















0















I am using Airflow to compose and schedule long-running batch jobs. I hoped to take advantage of Kubernetes (GKE) cluster autoscaling by using either KubernetesExecutor or KubernetesPodOperator.



I played with GKE node pools and node selectors, and it seems to work as expected by scaling the number of worker nodes up and down to zero. However, some Airflow operators in my old pipeline share data via a file system, which should not work out of the box in Kubernetes as the pods may get scheduled on different nodes.



One solution I can think of is to fuse dependant operators in a single job like a one bash script but this way I lose the benefit of reusing existing Airflow operators. Another approach might be using something like NFS to share a staging directory between multiple workers. I suppose this might have performance and scalability issues.



QUESTION:
Is it possible to direct Kubernetes to schedule pods on the same node so that Airflow operators can share data via the host file system? I tried to use pod affinity for that, but in my case, it didn't help to get operators rescheduled on the same node.










share|improve this question
























  • Did you try with antiAffinity/other affinities? banzaicloud.com/blog/k8s-affinities

    – Veerendra
    Mar 26 at 7:05














0












0








0


1






I am using Airflow to compose and schedule long-running batch jobs. I hoped to take advantage of Kubernetes (GKE) cluster autoscaling by using either KubernetesExecutor or KubernetesPodOperator.



I played with GKE node pools and node selectors, and it seems to work as expected by scaling the number of worker nodes up and down to zero. However, some Airflow operators in my old pipeline share data via a file system, which should not work out of the box in Kubernetes as the pods may get scheduled on different nodes.



One solution I can think of is to fuse dependant operators in a single job like a one bash script but this way I lose the benefit of reusing existing Airflow operators. Another approach might be using something like NFS to share a staging directory between multiple workers. I suppose this might have performance and scalability issues.



QUESTION:
Is it possible to direct Kubernetes to schedule pods on the same node so that Airflow operators can share data via the host file system? I tried to use pod affinity for that, but in my case, it didn't help to get operators rescheduled on the same node.










share|improve this question
















I am using Airflow to compose and schedule long-running batch jobs. I hoped to take advantage of Kubernetes (GKE) cluster autoscaling by using either KubernetesExecutor or KubernetesPodOperator.



I played with GKE node pools and node selectors, and it seems to work as expected by scaling the number of worker nodes up and down to zero. However, some Airflow operators in my old pipeline share data via a file system, which should not work out of the box in Kubernetes as the pods may get scheduled on different nodes.



One solution I can think of is to fuse dependant operators in a single job like a one bash script but this way I lose the benefit of reusing existing Airflow operators. Another approach might be using something like NFS to share a staging directory between multiple workers. I suppose this might have performance and scalability issues.



QUESTION:
Is it possible to direct Kubernetes to schedule pods on the same node so that Airflow operators can share data via the host file system? I tried to use pod affinity for that, but in my case, it didn't help to get operators rescheduled on the same node.







kubernetes airflow google-kubernetes-engine






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Mar 26 at 4:57







kpax

















asked Mar 26 at 3:18









kpaxkpax

1413 silver badges10 bronze badges




1413 silver badges10 bronze badges












  • Did you try with antiAffinity/other affinities? banzaicloud.com/blog/k8s-affinities

    – Veerendra
    Mar 26 at 7:05


















  • Did you try with antiAffinity/other affinities? banzaicloud.com/blog/k8s-affinities

    – Veerendra
    Mar 26 at 7:05

















Did you try with antiAffinity/other affinities? banzaicloud.com/blog/k8s-affinities

– Veerendra
Mar 26 at 7:05






Did you try with antiAffinity/other affinities? banzaicloud.com/blog/k8s-affinities

– Veerendra
Mar 26 at 7:05













0






active

oldest

votes










Your Answer






StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);













draft saved

draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55349352%2fscheduling-airflow-operators-on-the-same-kubernetes-node-with-autoscaling%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown

























0






active

oldest

votes








0






active

oldest

votes









active

oldest

votes






active

oldest

votes




Is this question similar to what you get asked at work? Learn more about asking and sharing private information with your coworkers using Stack Overflow for Teams.







Is this question similar to what you get asked at work? Learn more about asking and sharing private information with your coworkers using Stack Overflow for Teams.



















draft saved

draft discarded
















































Thanks for contributing an answer to Stack Overflow!


  • Please be sure to answer the question. Provide details and share your research!

But avoid


  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55349352%2fscheduling-airflow-operators-on-the-same-kubernetes-node-with-autoscaling%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

SQL error code 1064 with creating Laravel foreign keysForeign key constraints: When to use ON UPDATE and ON DELETEDropping column with foreign key Laravel error: General error: 1025 Error on renameLaravel SQL Can't create tableLaravel Migration foreign key errorLaravel php artisan migrate:refresh giving a syntax errorSQLSTATE[42S01]: Base table or view already exists or Base table or view already exists: 1050 Tableerror in migrating laravel file to xampp serverSyntax error or access violation: 1064:syntax to use near 'unsigned not null, modelName varchar(191) not null, title varchar(191) not nLaravel cannot create new table field in mysqlLaravel 5.7:Last migration creates table but is not registered in the migration table

용인 삼성생명 블루밍스 목차 통계 역대 감독 선수단 응원단 경기장 같이 보기 외부 링크 둘러보기 메뉴samsungblueminx.comeh선수 명단용인 삼성생명 블루밍스용인 삼성생명 블루밍스ehsamsungblueminx.comeheheheh

155 수학 과학 기타 둘러보기 메뉴eh추가해eh문서를 완성해