SYS-CON MEDIA Authors: Liz McMillan, Elizabeth White, Maria C. Horton, Andy Thurai, Zakia Bouachraoui

Blog Feed Post

Unix To PowerShell - Wc

PowerShell_unix PowerShell is definitely gaining momentum in the windows scripting world but I still hear folks wanting to rely on unix based tools to get their job done.  In this series of posts I’m going to look at converting some of the more popular Unix based tools to PowerShell.


The Unix “wc” (word count) command will print the character, word, and newline counts for each file specified and a total line if more than one file is specified.  This command is useful for quickly scanning a directory for small and large files or to quickly look at a file and determine it’s relative size.  The Get-Content Cmdlet will return the number of characters in the full but not the number of lines and words.  The following script will emulate the behavior of the Unix “wc” command with a few changes in the way parameters are supplied.

   1: #----------------------------------------------------------------
   2: # Wc.ps1
   3: #----------------------------------------------------------------
   4: param
   5: (
   6:   [string]$filespec = $null,
   7:   [bool]$lines = $false,
   8:   [bool]$words = $false,
   9:   [bool]$chars = $false,
  10:   [bool]$longest = $false
  11: );
  13: #----------------------------------------------------------------
  14: # function New-Stat
  15: #----------------------------------------------------------------
  16: function New-Stat()
  17: {
  18:   param([string]$name);
  19:   $stat = "" | select Lines, Words, Chars, Longest, Name;
  20:   $stat.Lines = $stat.Words = $stat.Chars = $stat.Longest = 0;
  21:   $stat.Name = $name;
  22:   $stat;
  23: }
  25: #----------------------------------------------------------------
  26: # function Get-ColumnList
  27: #----------------------------------------------------------------
  28: function Get-ColumnList
  29: {
  30:   param
  31:   (
  32:     [bool]$lines = $false,
  33:     [bool]$words = $false,
  34:     [bool]$chars = $false,
  35:     [bool]$longest = $false
  36:   );
  37:   $cols = $null;
  38:   if ( $lines ) { $cols += @("Lines"); }
  39:   if ( $words ) { $cols += @("Words"); }
  40:   if ( $chars ) { $cols += @("Chars"); }
  41:   if ( $longest ) { $cols += @("Longest"); }
  42:   $cols += @("Name");
  44:   $cols;
  45: }
  47: #----------------------------------------------------------------
  48: # function Do-Wc
  49: #----------------------------------------------------------------
  50: function Do-Wc()
  51: {
  52:   param
  53:   (
  54:     [string]$filespec = $null,
  55:     [bool]$lines = $false,
  56:     [bool]$words = $false,
  57:     [bool]$chars = $false,
  58:     [bool]$longest = $false
  59:   );
  61:   # if no options are given, set them all to true
  62:   if ( !$lines -and !$words -and !$chars -and !$longest )
  63:   {
  64:     $lines = $words = $chars = $longest = $true;
  65:   }
  67:   $stats = $null;
  69:   $stat_total = New-Stat -name "total";
  71:   [System.IO.FileInfo[]]$files = Get-ChildItem $filespec -ErrorAction SilentlyContinue;
  72:   if ( $null -ne $files )
  73:   {
  74:     foreach ($file in $files)
  75:     {
  76:       # new stat object for current file
  77:       $stat = New-Stat -name $file.Name;
  79:       [string[]]$content = Get-Content $file;
  81:       # wc only prints newlines so the last line is omitted
  82:       $stat.Lines = $content.Length -1;
  84:       # iterate through the lines in the file
  85:       for ($i=0; $i -lt $content.Length; $i++)
  86:       {
  87:         $info = $content[$i] | Measure-Object -word -line -char
  88:         $stat.Words += $info.Words;
  89:         $stat.Chars += $info.Characters;
  90:         if ( $info.Characters -gt $stat.Longest ) { $stat.Longest = $info.Characters }
  92:         # increment the count for newline characters (except for last line)
  93:         if ( $i -lt ($content.Length-1) ) { $stat.Chars += 2; }
  94:       }
  96:       $stat_total.Chars += $stat.Chars;
  97:       $stat_total.Lines += $stat.Lines;
  98:       $stat_total.Words += $stat.Words;
 100:       $stats += @($stat);
 101:     }
 102:     $stats += @($stat_total);
 104:     $stats | select (Get-ColumnList -lines $lines -words $words -chars $chars -longest $longest) | ft -AutoSize;
 105:   }
 106:   else
 107:   {
 108:     "No files matching pattern '$filespec' found!";
 109:   }
 110: }
 112: Do-Wc -filespec $filespec -lines $lines -words $words -chars $chars -longest $longest;

*Update – Thanks to twitter user @Jaykus for the tips on the Measure-Object cmdlet.  Although it didn’t offer a complete replacement, it did make the script faster than the character iteration for each line.

*Update 2 – Added support for longest line count as well.

Read the original blog entry...

More Stories By Joe Pruitt

Joe Pruitt is a Principal Strategic Architect at F5 Networks working with Network and Software Architects to allow them to build network intelligence into their applications.

Latest Stories
The Software Defined Data Center (SDDC), which enables organizations to seamlessly run in a hybrid cloud model (public + private cloud), is here to stay. IDC estimates that the software-defined networking market will be valued at $3.7 billion by 2016. Security is a key component and benefit of the SDDC, and offers an opportunity to build security 'from the ground up' and weave it into the environment from day one. In his session at 16th Cloud Expo, Reuven Harrison, CTO and Co-Founder of Tufin, ...
Historically, some banking activities such as trading have been relying heavily on analytics and cutting edge algorithmic tools. The coming of age of powerful data analytics solutions combined with the development of intelligent algorithms have created new opportunities for financial institutions. In his session at 20th Cloud Expo, Sebastien Meunier, Head of Digital for North America at Chappuis Halder & Co., discussed how these tools can be leveraged to develop a lasting competitive advantage ...
While the focus and objectives of IoT initiatives are many and diverse, they all share a few common attributes, and one of those is the network. Commonly, that network includes the Internet, over which there isn't any real control for performance and availability. Or is there? The current state of the art for Big Data analytics, as applied to network telemetry, offers new opportunities for improving and assuring operational integrity. In his session at @ThingsExpo, Jim Frey, Vice President of S...
"We were founded in 2003 and the way we were founded was about good backup and good disaster recovery for our clients, and for the last 20 years we've been pretty consistent with that," noted Marc Malafronte, Territory Manager at StorageCraft, in this interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
DevOps is often described as a combination of technology and culture. Without both, DevOps isn't complete. However, applying the culture to outdated technology is a recipe for disaster; as response times grow and connections between teams are delayed by technology, the culture will die. A Nutanix Enterprise Cloud has many benefits that provide the needed base for a true DevOps paradigm. In their Day 3 Keynote at 20th Cloud Expo, Chris Brown, a Solutions Marketing Manager at Nutanix, and Mark Lav...
In his keynote at 18th Cloud Expo, Andrew Keys, Co-Founder of ConsenSys Enterprise, provided an overview of the evolution of the Internet and the Database and the future of their combination – the Blockchain. Andrew Keys is Co-Founder of ConsenSys Enterprise. He comes to ConsenSys Enterprise with capital markets, technology and entrepreneurial experience. Previously, he worked for UBS investment bank in equities analysis. Later, he was responsible for the creation and distribution of life settl...
"At the keynote this morning we spoke about the value proposition of Nutanix, of having a DevOps culture and a mindset, and the business outcomes of achieving agility and scale, which everybody here is trying to accomplish," noted Mark Lavi, DevOps Solution Architect at Nutanix, in this interview at @DevOpsSummit at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
According to the IDC InfoBrief, Sponsored by Nutanix, “Surviving and Thriving in a Multi-cloud World,” multicloud deployments are now the norm for enterprise organizations – less than 30% of customers report using single cloud environments. Most customers leverage different cloud platforms across multiple service providers. The interoperability of data and applications between these varied cloud environments is growing in importance and yet access to hybrid cloud capabilities where a single appl...
@CloudEXPO and @ExpoDX, two of the most influential technology events in the world, have hosted hundreds of sponsors and exhibitors since our launch 10 years ago. @CloudEXPO and @ExpoDX New York and Silicon Valley provide a full year of face-to-face marketing opportunities for your company. Each sponsorship and exhibit package comes with pre and post-show marketing programs. By sponsoring and exhibiting in New York and Silicon Valley, you reach a full complement of decision makers and buyers in ...
In today's always-on world, customer expectations have changed. Competitive differentiation is delivered through rapid software innovations, the ability to respond to issues quickly and by releasing high-quality code with minimal interruptions. DevOps isn't some far off goal; it's methodologies and practices are a response to this demand. The demand to go faster. The demand for more uptime. The demand to innovate. In this keynote, we will cover the Nutanix Developer Stack. Built from the foundat...
"Cloud computing is certainly changing how people consume storage, how they use it, and what they use it for. It's also making people rethink how they architect their environment," stated Brad Winett, Senior Technologist for DDN Storage, in this interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
Sold by Nutanix, Nutanix Mine with Veeam can be deployed in minutes and simplifies the full lifecycle of data backup operations, including on-going management, scaling and troubleshooting. The offering combines highly-efficient storage working in concert with Veeam Backup and Replication, helping customers achieve comprehensive data protection for all their workloads — virtual, physical and private cloud —to meet increasing business demands for uptime and productivity.
Two weeks ago (November 3-5), I attended the Cloud Expo Silicon Valley as a speaker, where I presented on the security and privacy due diligence requirements for cloud solutions. Cloud security is a topical issue for every CIO, CISO, and technology buyer. Decision-makers are always looking for insights on how to mitigate the security risks of implementing and using cloud solutions. Based on the presentation topics covered at the conference, as well as the general discussions heard between sessio...
"NetApp's vision is how we help organizations manage data - delivering the right data in the right place, in the right time, to the people who need it, and doing it agnostic to what the platform is," explained Josh Atwell, Developer Advocate for NetApp, in this interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
A look across the tech landscape at the disruptive technologies that are increasing in prominence and speculate as to which will be most impactful for communications – namely, AI and Cloud Computing. In his session at 20th Cloud Expo, Curtis Peterson, VP of Operations at RingCentral, highlighted the current challenges of these transformative technologies and shared strategies for preparing your organization for these changes. This “view from the top” outlined the latest trends and developments i...