Score:0

High network egress from AMERICAS to EMEA on GCP compute and AWS EC2

us flag

Setup a 4 node Hadoop cluster (1 master, 3 workers) on both AWS and GCP. However experiencing, quite high Network egress for both platforms. AWS cluster apps: Hadoop, Yarn GCP cluster apps: Hadoop, Yarn, Hive

AWS resulted to a 244.027GB($21.96). This was 'pardoned' after explanation to AWS support. However, no info on the traffic to prevent future occurrence was provided. Hence, since there are no credits on AWS, had to put the cluster down.

GCP: same issue, but at least with credit limits.

Probably related: have received 'potential violation of service' due to DDOS attacks from both AWS and GCP. Recently, received it from GCP while setting up Kerberos on the cluster.

So far:

  1. Configure nodes to talk to each other using internal-ips (previously was external-ips).
  2. Firewall rules only for relevant ports.
  3. Close all UI browser tabs to apps (Hive, HDFS, Yarn) when not in use.
  4. Requested for AWS support for assistance on best practices and info on traffic. Received a lot of links on AWS material mostly on setting up billing alerts (not configuration or troubleshooting).
  5. GCP support very helpful. GCP billing is straightforward. Requested Tech support via chat - pending.

Any help on how to track where traffic is from.

Update: While working on only two of the nodes setting up Kerberos, seems I consumed up $100 of my remaining credits (on egress again) and cannot access my project unless upgrading to a full account.

cn flag
Enable VPC flow logging (at extra cost), then look at that? It'll generate netflow for the VPC and store in S3.
mangohost

Post an answer

Most people don’t grasp that asking a lot of questions unlocks learning and improves interpersonal bonding. In Alison’s studies, for example, though people could accurately recall how many questions had been asked in their conversations, they didn’t intuit the link between questions and liking. Across four studies, in which participants were engaged in conversations themselves or read transcripts of others’ conversations, people tended not to realize that question asking would influence—or had influenced—the level of amity between the conversationalists.