Technical Interview Questions: November 2018

Project management interview questions & answers - Part III

How have you contributed to the success of the project?
By:

Understanding end goals.
Understanding my role and each team's roles.
Identifying Interdependencies and stating them early on.
Cross functional and periodic communication via emails, standups, meetings.
Setting up milestones and periodic checking enabling team to collaborate well.
Accepting and managing problems.
Recognizing and rewarding the teams.

How would you increase efficiency of your development team?
By:

Document Coding and dev standards.
Implement CI integration.
Release often if possible.
Schedule demos.
Follow test driven development.
Creating detailed tasks.
Review backlog
Resolve blockers.
Have short meetings.
Support initiative
Rely and trust your team's expertise, where ever possible.

If you come about an early delay in one of your milestones what would you do?

Catch them early by monitoring the progress of the project and staying in touch with leads through out the project.
Notify stakeholders: Update them on delay and revised schedules.
Call for a meeting: Dev, technical teams, vendors, stakeholders, customers and update them on the delay.
Gather the right resources: Re-allocate the resources.
Reschedule: Check if some activities that were planned sequentially can be done parallely.
Re-prioritize: List out all the activities that are not yet done. Move the important ones to the top of the list.
Document the updated plan and send it out to everyone.

How to decide between traditional project management v/s Agile methologies?
Use traditional when:

Long and detailed planning is required.
Processes is linear and all the tasks are scheduled sequentially.
Requires a formal CM process.
Prioritization is fixed.
Customer feedback can be taken at the end and incorporated into future releases. Customer involvement is low.
Organization is very centralized.
ROI is achieved at product release.
Ex: Hardware or non-customer facing projects like infrastructure or technology changes.

Use Agile when:

Process and decision making is iterative.
Need customer feedback right away and can be incorporate right away.
Organization is de-centralized.
Prioritization changes based on business requirements and customer feedback.
Small amount of work is picked up to be done. Rest can be updated and prioritized based on inputs.
Customer involvement is high.
ROI is achieved often and is iterative. It determines what the future releases.

What are the various states of the project? - RYG. What do they stand for? How to move the project from R to G?
Green: Project is within budget, timeline, and expectation.
Yellow: Project might fallout of budget, timeline or expectation and is it risk. Requires special attention from team involved. If needed from higher ups.
Red: Some aspect of the project has fallen behind or encountered major setback or is over budget
How to move R to G: Have a plan on paper, get a buy in, and get it approved. Once everyone agrees to the new set of parameters then move it to G from R.
Some teams move G to Y or R to get feedback from upper management so that they can get their expertise involved to get it done.

Project management interview questions & answers - Part I

Big Data Basics

NoSql:

Data is highly unstructured.
Doesn't follow stringent structure of RDBMS enabling speed and agility.
DBs are distributed and data can be distributed across multiple nodes and servers.
Allows for horizontal scaling: As the data grows add more nodes without impacting performance.

Big Data:

Big Data refers to large collection of data (that may be structured, unstructured or semi structured) that expands so quickly that it is difficult to manage with regular database or statistical tools.
HDFS does not offer native support for security and authentication.
Cluster has nodes

Hadoop v/s conventional DB:

Hadoop:

Data is distributed across many nodes and processing
Write once, read many. Once you write the data, you can delete it but can't modify it.
Archival data: Telephonic call or transaction data
Doesn't support SQL at all.
It is an ecosystem of tools, technologies, and platforms.
Runs on many commodity H/W and uses commodity S/W.
Supports Hbase that is a NoSQL distributed DB.

Conventional DB:

Conceptually all data sits in one server/database.
Data can be modified.
Support SQL

Hadoop layers:

Bottom layer/Layer 1: Commodity Cluster Hardware
Middle layer/Hadoop Layer/Layer 2: MapReduce, HDFS
Top layer/Tools layer/Layer 3: RHadoop, Mahout, Hive, Pig, HBase, Sqoop
RHadoop: Supports statistical language R
Mahout: Machine learning
Hive/Pig: NoSQL
Sqoop: Getting data into and out of the Hadoop file system

Advantages:
1. Scalable
2. Cost effective in terms of processing large volumes of data.

=== Hive ===

It provides SQL intellect so that users can write such queries called as HQL to extract data from hadoop.
These SQL queries are converted to MapReduce queries. These queries in turn will communicate with HDFS.
Great platform to write SQL writes to interact with HDFS.
Not RDMS, or OLTP or real time updates or queries
Nice features:

Supports different file formats like sequence/text/avro/orc/rc file.
Metadata gets stored in RDBMS
Provides lots of compression techniques.
SQL queries are converted into MapReduce or tez or spark jobs.
UDF can include mapreduce scripts can be plugged
Specalized joins helps improve query function

Hive v/s RDBMS:
Hive:

Enforce schema on Read and not on write. So you can write any kind of data till you read it.
Supports storage of 100PetaBytes of data.
Doesn't support OLTP

RDBMS:

Schema on Write. Won't let insert any data if its out of schema.
Allows storage of around 10PB of data.
Support OLTP

Impala:

It is not mapreduce
It is Massively Parallel Processing engine on top of Hadoop to query and analyze the data sets.
Utilizes Hive metastore to store table structure
With the help of external tables, data resides in the Hadoop file system and structure in the metastore.
Popular for Data scientists and analysts.

Hive v/s Pig v/s Spark
Hive:

Gives non-programmers ability to query and analyze Hadoop DBs
Abstraction layer on top of Hadoop
Batch oriented framework
Useful for structured data.
Users can use SQL like interface to interact with backend Hadoop platform.
Supports:

Batch query processing: For huge datasets.
Interactive query processing: For real time data processing.

Hive queries get converted into MapReduce jobs.
Predefined or UDF(User Defined Functions) can be used to perform certain action.
In hive:

select * will create a fetch job but not map reduce
Aggregation functions like min, max, etc will create a map reduce job.

Pig:

Requires some programming knowledge to query and extract the data.
Abstraction layer on top of Hadoop
Batch oriented framework.
Useful for structured, semi-structured, and unstructured data.
Pig has 2 parts: Pig Latin and Pig runtime.

Pig runtime converts the job from Pig to MapReduce

Popular amongst data engineers

Spark:

In memory processing, you need to know java to utilize spark.
Faster but is low level since it requires coding knowledge.
Useful for structured, semi-structured, and unstructured data.

Decision making between Hive, Pig, Spark

If you have unstructured data then go with Pig or Spark.
If you have structured data then go with Hive and load the data into Hive.
If you want faster processing go with Spark.
If you are fine with waiting few hours then go with Pig or Hive.
If you have technical knowledge then go with Spark -> Pig -> Hive.

Few notes:

Solr: Elastic search tool. Searches for words within documents.
Sqoop is used to import data into Hadoop. Pig is used to process that data.
Hbase: column family NoSql DB

System design cheatsheet

Database scaling

Horizontal scaling is ensured by adding concurrent machines that will handle more requests.
Path1: The requests will be routed to SQL and it will become slow overtime. To make it better add more RAM, use sharding, denormalization, SQL tuning.
Path 2: Better way to handle scale is denormalize right from beginning or switch to scalable no-sql DB. Even after that you'll need to introduce a cache.

Caching

Users will see performance degradation when loads of data is fetched from the DBs. Cache needs to be implemented in such cases.
In-memory cache like Redis or Memcached should be considered and not file based caching.

Data is stored in the RAM.
Redis can do 100s of 1000s of reads/second.
Writes(including incremental ones) are faster too.

Cache sits between storage and application.
2 patterns are:

Cached database queries
Cached objects

A. Cached database queries

Store the query and its result in the cache.
Query is the key and result is value.
Problem: If just a column or row changes, you need to remove all the key-value pairs that reside in the cache. That row or column might be used by a lot of queries and might be present in a lot of results. So its not an ideal approach.

B. Cached objects

Store the class instance so that you can get rid of it if something changes.
If one DB column value has changed then you need to get rid of the relevant object and not complete object.
So its an ideal approach.

What to store in cache:

Sessions
User activity stream like twitter
Fully rendered blog posts
user <-> friend relationships

Types of asynchronism

A. For mostly static data that doesn't require a lot of pre-computation:

Website pages that are built with frameworks or CMS should be pre-rendered and stored on AWS or CDN.
Cron job performs these operations and store/push them on CDNs.
This will make the site super responsive and could handle multiple requests.

B. For dynamic data that requires intensive computation:

User comes to the site and requests an operation to be performed.
Site informs the user that its processing the task and informs the user once the job is done.

When the task comes it is placed in the queue.
Worker process will come and pick up the task from the queue. It will process it.
The worker process finishes the job and informs the Front end about it.
FE receives the signal and update the user.
Technologies used for queuing are: Redis list, RabbitMQ, ActiveMQ

Source:
http://www.lecloud.net/post/7295452622/scalability-for-dummies-part-1-clones
http://www.lecloud.net/post/7994751381/scalability-for-dummies-part-2-database
http://www.lecloud.net/post/9246290032/scalability-for-dummies-part-3-cache
http://www.lecloud.net/post/9699762917/scalability-for-dummies-part-4-asynchronism

Project management interview questions & answers

How would you handle non-productive developers or team-members?

Align big lofty goals with their personal goals. If not, incentivize them by helping them identify the personal goals(like learning a new programming language that will be used in this project that will help her/him get motivated)
Clearly define roles and responsibilities. Assign accountability to them.
Don't be overly strictly. Put some big rules in place but don't keep overemphasizing them.
Include the team in decision making and planning process.
Do retrospective meetings to ensure what has been accomplished and what not.
Talk to them to understand if they need help.

How will you get traction from a TPM of another team?

Communicate the goals to them and what impact they are going to make. Indicate KPIs or metrics that this goals could help achieve.
Align their projects with yours and involve them into planning.
Divide the projects into roadmap and sub-stories. Make them commit to the sub-stories and ask for timelines.
Offer help if they are not able to achieve those timelines.
Glorify them when the projects are achieved within reasonable time limits. This will make them very likely to work with you again.

What are important things of consider while running a cross-team program?

Helps if roles and responsibilities are defined.
Communication is shared. And their are frequent updates. Frequent meetings of leaders, where they provide updates from their teams.
Its best if they know the mission and goals one can achieve. Report frequently on KPIs and metrics.
Encourage cross-functional training.
Celebrating major milestones and congratulating the team on achieving the goals.

How do you earn trust of your team members.

Lead by example
Communicate openly
Don't place blame on any one person. Tackle the issue at hand without pointing fingers.
Discuss trust issues.
Be available to them to discuss project goals and communicate often.
Develop team exercises.
Be calm, open, and transparent where ever possible.
Encourage mutual feedback.
Allow flexibility in choosing projects, hours.
Be patient with new employees.

How do you plan a project or program?

Under the project goals.

Align them with company goals and mission.
Define KPIs and metrics.
Prioritize goals.

Identify stakeholders and meet with them. Discuss project goals with them to see if they help achieve their goals.
Create a product roadmap.
Divide roadmap into deliverables.
Assign deliverables to functional and dev teams. Create a project schedule of the deliverables.
Identify issues or technology gaps and complete risk assessment.

Think of alternatives to avoid or minimize risk.
Think of MVPs in case project runs into delivery issues.

Present the plan to stakeholders.
Communicate the plan to the dev teams and keep things moving.

What in your opinion are three constraints of a project or program?
Its called as Project management triple constraint. Those are:

Time
Cost
Scope
4th: Meets the customers requirements. - Make it PM diamond.

Tell-tale signs that your project is going to fail:
Objectives:

Missing strategy.
No clear goals.
Leadership priority issues.
Too many projects at one time.
Constant scope changes.
Last minute major changes.

Leaders:

Team doesn't trust higher level or management.
Management or key people leaves in between the projects.

People:

Stakeholders are not interested.
No knowledge sharing.
Resource limits

How do you motivate your team of developers?

Provide them flexibility to choose the projects and times they could work on whenever possible.
Involve them in planning and important decision making if possible. Don't ask them to do it because upper management is doing it.
Help them understand what KPIs or metrics you are trying to achieve. And how it will help shape the company.
Career growth: Help them with career growth if possible. Align the career goals to the projects that are pending.
Innovation: Allow them to utilize latest tools and technologies. If they are going to conferences to learn new things let them go. Request a demo to see you are interested what they have been learning. See if you can put sometime in project planning to let them experiment with new technologies.
Create structured trainings in the organizations. Let me take online trainings and reimburse it when needed.
Provide recognition of good work.
Good infrastructure to let them work.
Empower them to make the decisions where ever possible. Don't wait on you for each and everything.