element14 Community
element14 Community
    Register Log In
  • Site
  • Search
  • Log In Register
  • About Us
  • Community Hub
    Community Hub
    • What's New on element14
    • Feedback and Support
    • Benefits of Membership
    • Personal Blogs
    • Members Area
    • Achievement Levels
  • Learn
    Learn
    • Ask an Expert
    • eBooks
    • element14 presents
    • Learning Center
    • Tech Spotlight
    • STEM Academy
    • Webinars, Training and Events
    • Learning Groups
  • Technologies
    Technologies
    • 3D Printing
    • FPGA
    • Industrial Automation
    • Internet of Things
    • Power & Energy
    • Sensors
    • Technology Groups
  • Challenges & Projects
    Challenges & Projects
    • Design Challenges
    • element14 presents Projects
    • Project14
    • Arduino Projects
    • Raspberry Pi Projects
    • Project Groups
  • Products
    Products
    • Arduino
    • Avnet Boards Community
    • Dev Tools
    • Manufacturers
    • Multicomp Pro
    • Product Groups
    • Raspberry Pi
    • RoadTests & Reviews
  • Store
    Store
    • Visit Your Store
    • Choose another store...
      • Europe
      •  Austria (German)
      •  Belgium (Dutch, French)
      •  Bulgaria (Bulgarian)
      •  Czech Republic (Czech)
      •  Denmark (Danish)
      •  Estonia (Estonian)
      •  Finland (Finnish)
      •  France (French)
      •  Germany (German)
      •  Hungary (Hungarian)
      •  Ireland
      •  Israel
      •  Italy (Italian)
      •  Latvia (Latvian)
      •  
      •  Lithuania (Lithuanian)
      •  Netherlands (Dutch)
      •  Norway (Norwegian)
      •  Poland (Polish)
      •  Portugal (Portuguese)
      •  Romania (Romanian)
      •  Russia (Russian)
      •  Slovakia (Slovak)
      •  Slovenia (Slovenian)
      •  Spain (Spanish)
      •  Sweden (Swedish)
      •  Switzerland(German, French)
      •  Turkey (Turkish)
      •  United Kingdom
      • Asia Pacific
      •  Australia
      •  China
      •  Hong Kong
      •  India
      •  Korea (Korean)
      •  Malaysia
      •  New Zealand
      •  Philippines
      •  Singapore
      •  Taiwan
      •  Thailand (Thai)
      • Americas
      •  Brazil (Portuguese)
      •  Canada
      •  Mexico (Spanish)
      •  United States
      Can't find the country/region you're looking for? Visit our export site or find a local distributor.
  • Translate
  • Profile
  • Settings
FPGA
  • Technologies
  • More
FPGA
Blog The Art of FPGA Design Season 2 Post 28
  • Blog
  • Forum
  • Documents
  • Quiz
  • Events
  • Polls
  • Files
  • Members
  • Mentions
  • Sub-Groups
  • Tags
  • More
  • Cancel
  • New
Join FPGA to participate - click to join for free!
  • Share
  • More
  • Cancel
Group Actions
  • Group RSS
  • More
  • Cancel
Engagement
  • Author Author: fpgaguru
  • Date Created: 19 Aug 2025 10:39 PM Date Created
  • Views 835 views
  • Likes 7 likes
  • Comments 3 comments
  • xilinx
  • fpgafeatured
  • fpga
  • dsp
  • guest writer
Related
Recommended

The Art of FPGA Design Season 2 Post 28

fpgaguru
fpgaguru
19 Aug 2025

Look Ma, no multipliers!

The last LWDF IIR filter example I gave was unusual in the sense that it was multiplierless, all the coefficients were powers of two, which in FPGA hardware cost absolutely nothing. Of course, this restricts considerably the kind of transfer functions that can be achieved, even when using tricks like cascading multiple instances of the same simpler filter. 

We have seen that the adders and multipliers in the first and second order all pass sections map very well into the FPGA DSP blocks, with their pre-adder, multiplier and post-adder, especially if you can somehow arrange four delays in the feedback loops to achieve full pipelining. Earlier Xilinx FPGA families like 7-Series and UltrScale (but not the latest Versal one) have a very nice feature that lets you do a 3-input adder with the same resources and pipelining requirements of a simple 2-input adder. Named Carry Save Adder, or sometimes 3:2 compressor, it gives you an extra free adder for every normal ripple carry chain binary adder you use.

If the multiplier coefficient is a power of two, or a sum or difference of a small number of powers of two, it is possible to map the entire multiplier, plus the pre and post adders into an equally small number of 3-input adders, while at the same time achieving full pipelining with two register levels instead of the usual four a DSP primitive requires. 

Below are examples of optimal implementations of all pass sections when the coefficient has 1, 2, 3 respectively 4 non-zero bits. An extra degree of freedom can be gained by considering both positive and negative non-zero bits, since subtraction doesn't cost anything extra. For example, 31/64 does not have five non-zero bits, i.e. (16+8+4+2+1)/64, it only has two, (32-1)/64. 

The simplest case we have already encountered is a single non-zero bit, or a power of 2. For example, if c=1/2: 

image

we can replace the two adders and the multiplier with a single 3-input adder, without the need of introducing any extra registers: 

image

For 2, 3 or 4 non-zero coefficient bits we need two register levels for full pipelining. For the 2 non-zero bit case I will use c=31/64 as a numerical example:

image

Now two 3-input adders and some extra FFs are needed: 

image

The 3 non-zero bits case, for example c=25/64: 

image

can be done with three 3-input adders: 

image

Finally, the 4 non-zero bit coefficients like c=43/64: 

image

will require four 3-input adders: 

image

The 2, 3 and 4 non-zero bit cases can be pipelined with just two register levels, so they can achieve sample rates equal to half the clock rate, compared with a quarter clock rate when DSP primitives are used.

So there are significant benefits to these multiplierless schemes but how practical are they? When you design for a certain filter transfer function and then aggressively quantize the coefficients to achieve these 2, 3 or 4 non-zero bit values you get a different filter. Fortunately, LWDF IIRs, similar to FIR filters have almost all their coefficient sensitivity concentrated in the stop band. When you quantize the coefficients the pass band will change very little while the stop band can change a lot. This happens to work very well with the trick of designing for the square root of the filter you actually need and then cascading two such instances. This makes it easier to achieve both very little passband ripple and high stop band attenuation while aggressively quantizing the coefficients. 

Add on top of that the doubling in the maximum achievable sample rate and the savings in DSP primitives utilization and this becomes a very interesting design solution. 

I will end this series on LWDF IIR filters with a final post considering more advanced techniques of pipelining such filters to achieve higher sample rates. 

Back to the top: The Art of FPGA Design Season 2

  • Sign in to reply
  • flyingbean
    flyingbean 26 days ago

    It is enjoyable to read your blogs after a busy day.

    • Cancel
    • Vote Up 0 Vote Down
    • Sign in to reply
    • More
    • Cancel
  • DAB
    DAB 1 month ago

    Great series, thanks.

    • Cancel
    • Vote Up 0 Vote Down
    • Sign in to reply
    • More
    • Cancel
element14 Community

element14 is the first online community specifically for engineers. Connect with your peers and get expert answers to your questions.

  • Members
  • Learn
  • Technologies
  • Challenges & Projects
  • Products
  • Store
  • About Us
  • Feedback & Support
  • FAQs
  • Terms of Use
  • Privacy Policy
  • Legal and Copyright Notices
  • Sitemap
  • Cookies

An Avnet Company © 2025 Premier Farnell Limited. All Rights Reserved.

Premier Farnell Ltd, registered in England and Wales (no 00876412), registered office: Farnell House, Forge Lane, Leeds LS12 2NE.

ICP 备案号 10220084.

Follow element14

  • X
  • Facebook
  • linkedin
  • YouTube